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Let G be an afline group scheme defined over a field k, and denote by Rep k G the 
category of finite dimensional representations of G over k. The principle of tannakian 
duality states that any neutral tannakian category is tensorially equivalent to Rep fc G 
for some afline group scheme G and field k, and conversely. 

Originally motivated by an attempt to find a first-order explanation for generic 
cohomology of algebraic groups, we study neutral tannakian categories as abstract 
first-order structures and, in particular, ultraproducts of them. One of the main the- 
orems of this dissertation is that certain naturally definable subcategories of these 
ultraproducts are themselves neutral tannakian categories, hence tensorially equiva- 
lent to ComodA for some Hopf algebra A over a field k. We are able to give a fairly 
tidy description of the representing Hopf algebras of these categories, and explicitly 
compute them in several examples. The work done in this vein constitutes roughly 
half of this dissertation. 

The second half is much less abstract in nature, as we turn our attention to 
working out the representation theories of certain unipotent algebraic groups, namely 
the additive group G a and the Heisenberg group Hi. The results we obtain for 
these groups in characteristic zero are not at all new or surprising, but in positive 
characteristic they perhaps are. In both cases we obtain that, for a given dimension 

iii 



n, if p is large enough with respect to n, all n-dimensional modules for these groups 
in characteristic p are given by commuting products of representations, with the 
constituent factors resembling representations of the same group in characteristic 
zero. This has led us to define the 'height-restricted ultraproduct' of the categories 
Rep k _G for a sequence of fields hi of increasing positive characteristic, and the above 
result can be summarized by saying that these height-restricted ultraproducts are 
tensorially equivalent to Rep fc G n , where G n denotes a direct product of copies of G 
and k is a certain field of characteristic zero. We later use these results to extrapolate 
some generic cohomology results for these particular unipotent groups. 
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Chapter 1 
Introduction 



Consider the following two theorems: 

Theorem 1.0.1. (see Corollary 3.4 of Let G be a simple, simply connected al- 
gebraic group defined and split over¥ p , and A a dominant weight. If p is sufficiently 
large with respect to G, A and n, then the dimension of H n (G(¥ p ), S(X)) is indepen- 
dent of p. 

Theorem 1.0.2. Let be a first-order statement in the language of fields such that 
is true for every characteristic zero field. Then is true for all fields of sufficiently 
large positive characteristic. 

The first is a classic generic cohomology theorem; if you can assume such and such 
a thing to be large (in this case, characteristic), cohomology stabilizes. The second is 
a textbook exercise in model theory, an easy consequence of the compactness theorem 
for first-order logic. The analogy between these two statements has been the broad 
motivation for the following: is there a first-order explanation for the phenomenon of 
generic cohomology? 

Our investigations into this question have, as fate would have it, led us far astray 
from our original objective. The majority of this dissertation is devoted to the study 
of neutral tannakian categories as abstract first-order structures (roughly speaking, 
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the categories which can in some sense be thought of as Rep fc G for some affine group 
scheme G and field k), and in particular, ultraproducts of them. To this end we 
identify certain subcategories of these ultraproducts which themselves are neutral 
tannakian categories, hence tensorially equivalent to Comod^ for some Hopf algebra 
A. We are able to provide a general formula for A, and explicitly compute it in several 
examples. 

For the remainder we turn away from ultraproducts, and instead to the study of 
the concrete representation theories of certain unipotent algebraic groups, namely the 
additive group G a and the Heisenberg group H\. For both groups we obtain a certain 
'generic representation theory' result: that while the characteristic p > and charac- 
teristic zero theories of both can, by and large, be expected to bear little resemblance 
to one another, if instead one is content to keep positive characteristic large with 
respect to dimension, there is in fact a very strong correspondence between the two. 
These results are later codified by considering the 'height-restricted ultraproduct' of 
these groups for increasing characteristic, and from them we are able to generate some 
modest, 'height-restricted' generic cohomology results for these groups. 



1.1 Preliminaries 

For an algebraic group G defined over Z and a field k, Rep fc G is the category of 
finite dimensional representations of G over k. This category is tensorially equivalent 
to ComodAigifc, where A is the representing Hopf algebra of G over Z, and we generally 
prefer to think of it as the latter. If ki is a collection of fields indexed by I and U a non- 
principal ultrafilter over J, we consider the ultraproduct of the categories Comod^ 
with respect to U, with A4 = A <g> k{, which we denote as nwComod^. 

The language over which these categories are realized as first-order structures, 



which we call the 'language of abelian tensor categories' (section 4.1), includes sym 
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bols denoting an element being an object or morphism, composition of morphisms, 
addition of morphisms, morphisms pointing from one object to another, and notably, 
a symbol for the tensor product (of objects and morphisms). It also includes symbols 
denoting certain natural transformations on the category, necessary to describe cer- 
tain regularity properties of the tensor product, e.g. being naturally associative and 
commutative. The primary reason we have chosen these symbols is 

Theorem 1.1.1. (see chapter^ In the language of abelian tensor categories, the 
statement "is a tannakian category" is a first-order sentence. 

Chapter [3] is devoted to giving an explicit definition of a tannakian category. 
Suffice it to say for the moment, it is an abelian category C, endowed with a bifunctor 
®:CxC->C, which satisfies a plethora of regularity conditions, e.g. being naturally 
associative and possessing internal Homs. We say that a tannakian category C is 



neutral (see definition 3.2.6) if it comes equipped with a fibre functor, i.e. an exact, 
faithful, fc-linear tensor preserving functor u from C to Vec/c (the category of finite 
dimensional vector spaces over k, where k is the field Endc(l), and 1 denotes the 
identity object of C). The motivation for the definition of a neutral tannakian category 
is the following theorem. 

Theorem 1.1.2. (see theorem 2.11 of fljjj) Let C be a neutral tannakian category over 
the field k with fibre functor u. Then 

1. The functor Aut®(u) on k-algebras is representable by an affine group scheme 
G 

2. oj defines an equivalence of tensor categories between C and Rep k G 

The moral: a neutral tannakian category is (tensorially equivalent to) the cate- 
gory of finite dimensional representations of an affine group scheme over a field, and 



vice versa. (Section 3.3 is devoted to describing how one goes about, in principle, 



recovering the representing Hopf algebra of a neutral tannakian category.) 



1.2 The Restricted Ultraproduct of Neutral Tan- 
nakian Categories 



As is argued in section 15.1 the basic concepts of cohomology of modules (at least 
in the case of Ext 1 ) are quite naturally expressible in the language of abelian tensor 
categories. Therefore, to study cohomology over a particular group G and field k, 
a reasonable object of study is the category Rep fc G as a first-order structure in this 
language. But further, we are interested in studying generic cohomology; that is, 
for a fixed group G and sequence of fields ki, we would like to know if a particular 
cohomological computation eventually stabilizes for large enough i. We are then 
drawn to the study of not the single category Rep fc G for fixed k, but rather the infinite 
sequence of the categories Rep fc .G. And as ultraproducts of relational structures, by 
design, tend to preserve only those first-order properties which are true 'almost all of 
the time', it is for this reason that we have chosen to study ultraproducts of categories 
of the form Rep fc .G, which we denote by J7 M Rep fc .G. 

While being a tannakian category is a first-order concept, the property of being 
endowed with a fibre functor, so far as we can tell, is not. If Cj is a sequence of 
tannakian categories neutralized by the fibre functors Ui, the natural attempt to 
endow Y\ u Ci with a fibre functor would go as follows. Define a functor uj on Y\ u Ci 
which takes an object [Xj\ £ Y\ u ^i ^° EL^f^) (ultraproduct of vector spaces; see 
section 6.2 ), and similarly for a morphism [0j] (ultraproduct of linear maps; see section 



6.2.1 ). But this will not do; oj([X;]) will in general be infinite dimensional (proposition 



6.2.4), specifically disallowed by the definition of a fibre functor. Further, for any 



collection of vector spaces Vi and Wi over the fields ki, we have a natural injective 



ma P Ylu^i ® rL^ 7 * ~~ Ylu^i ® Wi (section 6.2.2). But unless at least one of the 
collections is boundedly finite dimensional, this will not be an isomorphism; thus u 
will not be tensor preserving in general. We therefore make the following compromise: 
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Definition 1.2.1. The restricted ultraproduct of the Cj, denoted [] fi Cj, is the 
full subcategory of Y\ u Ci consisting of those objects [Xj\ such that the dimension of 
u>i(Xi) is bounded. 

Then we indeed have 



Theorem 1.2.1. (see theorems 7.2.3 and 7.2.4) Yl R ^i ^ s a tannakian category, neu 



tralized over the field k = Y[ u h by the functor u described above. 

Thus, Y\ R Ci is tensorially equivalent to Comodyi^ for some Hopf algebra over 
the field k = The question then: what is A^l 

The obvious first guess, that it is the ultraproduct of the Hopf algebras A; rep- 
resenting each of the C«, is not correct; problem being, this is not a Hopf algebra at 
all. We start by defining a map A on by the formula n«^i EL^* ® ^ 

(the ultraproduct of the maps A$). But again, unless the are boundedly finite 
dimensional, we cannot expect this A to point to riw^i <S> c Ylu ^ ® ^ * n 

general. So we make another compromise: 



Definition 1.2.2. (see section 9.1) The restricted ultraproduct of the Hopf alge- 
bras Ai, denoted Ar, is the collection of all [aj G riw^ such that the rank of is 
bounded. 

Defining exactly what "rank" means here takes some doing, so we defer it; suffice 
it to say, Ar can indeed be given the structure of a coalgebra, under the definition of 
A given above. We are able to prove [9] 

Theorem 1.2.2. The representing Hopf algebra of the restricted ultraproduct of the 
categories ComodAi is isomorphic to the restricted ultraproduct of the Hopf algebras 
A,. 



In section 9.5 we explicitly work out Aoo for a few examples. If G is a finite 



jroup defined over Z with representing Hopf algebra A, and if ki is any collection of 



fields, then the Hopf algebras A® k{ are constantly finite dimensional, whence the full 
ultraproduct Y\ U A (g> k^ is in fact a Hopf algebra. In this case can be identified 
with A ® n w ^., whence Y\ R Rep kx G ~ Rep^G. 

For non-finite groups, the situation becomes considerably more delicate. As an 



example, consider the multiplicative group G = G m (subsection 9.5.2) and let hi be 
any collection of fields. For a fixed ultrafilter U, let Ylu^ denote the ultrapower of 
the integers. Then we can identify A^ as the k = Y\ u ki-span of the formal symbols 
x^ Zi \ [zi] E Yl u Z, with A and mult defined by 

A^ = span fc (:r w : [ Zi \ E Y[Z) 

u 

A : x [z ' ] i-> x [Zl] <8> x [Zl] 
mult : x W ®x [m] ^ x [Zi+Wi] 

We also note here that chapter [8] contains an interesting theorem about finite 
dimensional subcoalgebras of Hopf algebras which was necessary to prove theorem 



1.2.2, but is certainly of interest in its own right, and requires no understanding of 



ultraproducts. 

1.3 From Ultraproducts to Generic Cohomology 

The reason we chose to study these categories in the first place is because coho- 
mology of modules (at least in the Ext 1 case) is a naturally expressible concept in the 



language of abelian tensor categories. That is (see section 15.1) 

Proposition 1.3.1. For fixed n, the statement <p(M ', N) = "Ext 1 (M, N) has dimen- 
sion n" is a first- order formula in the language of abelian tensor categories. 

Here we have adopted the view that Ext 1 (M, N), relative to a given abelian cat- 
egory, consists of equivalence classes of module extensions of M by N, as opposed to 



the more standard definition via injective or projective resolutions; necessary, since 
the category Rep fc G will in general not have enough injective or projective objects 
(due to it consisting of only finite dimensional representations of G over k). Suppose 
then that M and N are modules for G over Z, and that fcj is a collection of fields. 
We wish to discover whether the quantity 



dimExt^ (fci) (M,iV) 
stabilizes for large i. We have a criterion for this to be true. 

Theorem 1.3.2. Let Mi and Ni denote the images inside Rep k .(G) of the modules 
M and N , and let [Mi], [Ni] denote the images of the tuples (Mi), (Ni) inside the 
category \\ R Rep k .(G). Then if the computation dimExt^i\M^\, [Ni]) is both finite and 
the same inside the category Y[ R Rep ki (G) for every choice of non-principal ultrafilter, 
the computation dim Ext l G ^(Mi, Ni) is the same for all but finitely many i. 

Proof. The key fact (and the reason we restrict to Ext 1 in the first place) is that 
computing Ext 1 in the restricted ultracategory Yl R Ci is the same as doing so in the 
full ultracategory [] M Cj, since the extension module of any 1-fold extension of [Mi] by 
[N^ has bounded dimension dim(Mj) +dim(iVj). First-order statements that are true 
in J} w Rep A .(Cr) for every choice of non-principal ultrafilter correspond to statements 
that are true for all but finitely many of the categories n M Rep fc .(G), namely the 
statement "dim Ext^^M;, Ni) =n". □ 

Our attempts to extend these results to the case of Ext n , n > 1, have so far met 



with resistance; for more on this see section 15.3 
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1.4 Generic Representation Theory of Unipotent 
Groups 



Beginning in chapter 10 we take a break from working with ultraproducts, and 
instead focus on the concrete representation theory of two unipotent algebraic groups, 
both in zero and positive characteristic. Starting with the additive group G a (chapter 



12) we prove 



Theorem 1.4.1. (see theorem 12.3.6) 



1. Let k have characteristic zero. Then every n- dimensional representation of G a 
over k is given by an n x n nilpotent matrix N over k according to the formula 



e xN 



2. Let k have positive characteristic p. Then if p » n, every n-dimensional 
representation of G a over k is given by a finite ordered sequence Ni of n x n 
commuting nilpotent matrices over k according to the formula 



This is our first indication of a connection between the characteristic zero theory 
of a unipotent group and its positive characteristic theory for p large with respect to 



dimension. In chapter 13 we obtain an identical result for the Heisenberg group Hi. 



Theorem 1.4.2. (see theorems 13.5.4 an d 13.5.5) 



1. Let k have characteristic zero. Then every n-dimensional representation of Hi is 
given by a triple X,Y, Z ofnxn nilpotent matrices over k satisfying Z = [X, Y] 
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and [X, Z] = [Y, Z] = 0, according to the formula 

e xX+yY+(z-xy/2)Z 

2. Let k have positive characteristic p . Then ifp » n, every n- dimensional repre- 
sentation of Hi over k is given by a sequence X , Y Q , Z Q , X ± , Y ± , Z 1 . . . , X m , Y m , Z r 
ofnxn nilpotent matrices over k satisfying Z; L = [JQ, Yj\, [JQ, Zj\ = [Yi, Zj\ = 0, 
and whenever i ^ j , X^Y^Zi all commute with Xj, Yj, Zj, according to the for- 
mula 

e xX +yY + (z-xy/2)Z ^PX^y^ + izP-xPyP /2)Z! ^ X m +yV m Y m + (zP m -x?™ y?™ /2)Z m 

We see then that for p sufficiently larger than dimension, characteristic p represen- 
tations for these unipotent groups are simply commuting products of representations, 
each of which 'look like' a characteristic zero representation, with each factor ac- 
counting for one of its 'Frobenius layers'. It is this phenomenon to which the phrase 
'generic representation theory' in the title refers. 

Note that this is only a theorem about p » dimension; for any positive charac- 
teristic field k, once dimension becomes too large in the category Rep k H 1 , the analogy 
completely breaks down, and representations of Hi over k can be expected to bear 
no resemblance to representations in characteristic zero. 

1.5 The Height-Restricted Ultraproduct 

Let G be either of the two above discussed unipotent groups, k a field of char- 
acteristic p > 0, V a representation of G over k. Suppose that V is of the form 
described in part 2. of the two preceding theorems. Then we define the height of 
V to be m + 1, that is, the number of Frobenius layers in the representation. For 
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instance, in the case of G ai height is simply the largest m such that x pm occurs in 
the matrix formula of the representation. 

Now let ki be a sequence of fields of strictly increasing positive characteristic, and 
let Cj = Rep fcj G. For an object [Xi] of the restricted ultraproduct W R Ci (i.e. where 
the Xi have bounded dimension), by the above two theorems, for large enough i, Xi 
will be of the aforementioned form, so that for all but finitely many i, the height of 
Xi is well-defined. We therefore define the height of [Xi] as the essential supremum 
of {height(Xj) : % G I}. Note that the height of a given object [Xi] of Yl R Ci might 
well be infinite. In case it is not, we define 



Definition 1.5.1. (see definition 14.0.2) The height-restricted ultraproduct of 

the categories Ci = Rep fei G, for ki of increasing positive characteristic, is the full 
subcategory of the restricted ultraproduct Y\ R Ci consisting of those objects [Xi] of 
finite height. We denote this category by fX^Cj. For n G N, we denote by [] Jf<n C i 
the full subcategory of consisting of those objects of height no greater than n. 

We have seen already that, for p sufficiently large with respect to dimension, 
representations of G in characteristic p resemble representations of G n in characteristic 
zero. We shall also see later that this resemblance is functorial, in the sense that the 
analogy carries over to morphisms between the representations, and to various other 
constructions, e.g. direct sums and tensor products. The most compact way to express 
this is 



Theorem 1.5.1. (see theorem 14-0.6) If hi is a sequence of fields of strictly increasing 
positive characteristic, the category Y\ H Rep k G is tensorially equivalent to Rep k G°° , 
where G°° denotes a countable direct power of G, and k is the ultraproduct of the 
fields ki. Similarly, for any n G N, Yl H<n Rep k .G ^ s tensorially equivalent to Rep k G n . 

Note in particular that the group G n obtained is independent of the choice of 
non-principal ultrafilter, and while the field k does vary, it will in all cases have 
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characteristic zero. 



1.6 Height-Restricted Generic Cohomology 



This last result will allow us to derive some generic cohomology theorems for the 
unipotent algebraic groups discussed above, at least for the case of Ext 1 . Rather than 
state the theorem precisely here it will be much more illuminating to illustrate with 



an example, which is worked out in more detail in section 15.2.1 



Let G = G a and let k have characteristic p > 0. Then direct computation shows 
(using theorem 12.3.1) that a basis for ExtQ, k \(k,k) is given by the sequence of 



linearly independent extensions 



£ m : k -> 



( x xP ™\ 



\ 1 J 



k^O 



for m — 0, 1, . . .. This is obviously infinite dimensional, so we ask the more interesting 
question: what happens when we restrict the height of the extension module? Specifi- 
cally, let Extg^^/c, k) denote the space of equivalence classes of extensions whose ex- 
tension module has height no greater than h. Then of course {£ m : m = 0, 1, . . . , h— 1} 
forms a basis for it, and its dimension is h. 

Now assume k has characteristic zero and compute Extg h ( fc )(/c, k). Then similarly 



using theorems 12.2.1 and 11.0.3) we have the basis 



£ m : k -> 



1 X r 



\ 



1 ) 



-)• k 



for m — 0, 1, . . . , h — 1, where x m denotes the m th free variable of G h . Thus we see 
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that, for ki of sufficiently large positive characteristic 



dim ExtJ£ (fc0 (fci, h) = dim Ext^^^d!**. U k i 



u u 



This example is misleading in that the above equality holds for all primes p (due 
to the small dimension of the extension modules of the extensions); in general we 
merely claim that the above holds for sufficiently large characteristic. We shall prove 



in section 15.2 that this is quite a general phenomenon. 



1.7 Notational Conventions 

Throughout this dissertation our convention for expressing composition of maps 
is as follows. If <p : X — > Y and ip : Y — > Z are maps, then the composition 
X —^-t- Y — > Z shall be expressed as 



<fi o ip 

However, if x is an element of X, then the element z G Z gotten by evaluating <po ip 
at x shall be expressed as 

V>(0(z)) 

In other words, when we are expressing a composition with no inputs we shall write 
functions on the right, and when we are evaluating a composition at an input we 
shall write functions on the left. The reader need only remember that whenever he 
sees the symbol o as in cf) o we are writing functions on the right, and when he 
instead sees the parenthetical notation ip(<j)(x)) we are writing functions on the left. 
In particular, we shall never write (x)((f> o ip), (0 o ip)(x), ({x)<f>)ip, <p(ip(x)), or ip o 0. 

A matrix, if we wish to emphasize what its entries are, is generally written in the 
notation (fly), suppressing mention of its dimensions. If </> : U — » V is a linear map 
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between finite dimensional vector spaces, and if (kij) is the matrix representing in 
certain bases, then it is understood to be doing so be passing column vectors to the 
right of (kij). In particular, if <p : U — > V, if) : V — > W are linear maps, and if (k^) 
represents <fi and (Uj) represents ip, then the matrix product (hj)(kij) represents the 
map o ?f). 

If Mi is a collection of relational structures in a common signature indexed by 
the set J, and if U is a non-principal ultrafilter over /, we denote by riw^i the 
ultraproduct of these structures with respect to U. The reader may consult the 
appendix for a review of ultrafilters and ultraproducts in general. In several instances 
in this dissertation we shall be considering certain substructures of ultraproducts, 
e.g. the restricted ultraproduct of the neutral tannakian categories C iy which in this 
case we denote by n^i- Note that, in this case and in several others, to avoid using a 
double subscript, we have dropped reference to the particular non-principal ultrafilter 
being applied; as it will always be assumed to be fixed but arbitrary, no confusion 
should result. 

The reader is encouraged to consult the index for a more complete list of commonly 
used symbols. 
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Chapter 2 



Algebraic Groups, Hopf Algebras, 
Modules, Comodules and 
Cohomology 

Here we review the basic facts concerning the duality between algebraic groups 
and their Hopf algebras, modules for algebraic groups vs. comodules for Hopf algebras, 
and the definitions concerning cohomology of modules and comodules. We shall also 
define the equivalent categories Rep k G and ComodA, where A is the representing 
Hopf algebra of G, and the important constructions within them. We shall mostly 
be content to recording definitions and important theorems, only rarely supplying 
proofs. The reader may consult [13], [I], and [TO] for a more thorough and excellent 
account of what follows. We particularly recommend the first several chapters of 
[16] for those less accustomed to the 'functorial' view of algebraic groups we shall be 
adopting. [12] and [2] are also excellent references, but with much less of an emphasis 
on this functorial view. 
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2.1 Algebraic Groups, Coalgebras and Hopf Alge- 
bras 



In this dissertation, "algebraic group" over a ring k shall always mean a particular 
kind of affine group scheme. It is at this level of generality in which we will operate 
throughout. If k is any commutative ring with identity (and all rings will assumed to 
be so) a A>algebra shall always mean a commutative fc-algebra with identity. 

Definition 2.1.1. (see section 1.2 of [16J) An affine group scheme over a com- 
mutative ring k with identity is a representable covariant functor from the category 
of all fc-algebras to the category of groups. We say it is an algebraic group if the 
representing object of the functor is finitely generated as a /c-algebra, and a finite 
group if it is finitely generated as a fc-module. 

Definition 2.1.2. (see section 1.1 of [I]) Let k be a ring, C a k- module. C is 
called a fc-coalgebra if it comes equipped with fc-linear maps A : C — > C <8> C (co- 
multiplication) and £ : C — > k (co-unit) making the following two diagrams commute: 



C 



A 



A 



1®A 



(2.1.1) 



c® c 



A®1 



c 




(2.1.2) 



c® c 
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A /c-bialgebra is a fc-module C which is simultaneously a /c-algebra and a fc-coalgebra 
in such a way that A and e are algebra maps. A bialgebra C is called a Hopf algebra 
if it comes equipped with a fc-algebra map S : C — )■ C (co-inverse or antipode) making 
the following commute: 




A® A 



(2.1.3) 



A morphism between the /c-coalgebras (C, A) and (C, A') is a fc-linear map : C — > 
C such that the following diagram commutes: 



C 



c®c 



a 



Theorem 2.1.1. (see section 1.4 of fTbJ ) The representing object of an affine group 
scheme over k is a Hopf algebra over k. Conversely, any Hopf algebra over k defines 
an affine group scheme over k. 

Let G( ) = Homfc(A, ) be an affine group scheme over k represented by the 

Hopf algebra A. As the names suggest, the co-multiplication, co-unit, and co-inverse 
maps attached to a Hopf algebra encode the group multiplication, identity, and in- 
version, respectively. If R is a /c-algebra then an element of G(R) is by definition a 
fc-homomorphism : A — > R. Then the map A tells us how to multiply elements of 
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G(R); given 0, tj) : A — > R their product, call it 4>*i[), is denned to be the unique map 
making the following diagram commute: 



A 



A® A 



R 



mult 



R®R 



Similarly the inverse of the element 0, call it inv(0), is the unique map making 



mv(<j>) 




commute. Finally, the identity element e of G(R) is defined by the commutativity of 
the diagram. 




2.1.2 



With this in mind, diagrams 2.1.1 2.1.2 and |2. 1.3 are really not mysterious at 

iplication defined by A is associative, 
k — > R is always the identity element 



all. |2. 1. 1 merely encodes the fact that the multiplication defined by A is associative 
corresponds to the statement that A - 



of G(R), and one can probably guess what fact about groups [2. 1.3 represents. 

When we refer to a Hopf algebra we shall often write it as (A, A, e), emphasizing 
the fact that these are often the only pieces of information we require for a given 
purpose. Besides, just as inverses can be discovered by looking at the multiplication 
table of a group, so also is the map S completely determined by the map A (and the 



17 



same can be said for the map e). 

Essential to the study of representable functors in any category is the so-called 
Yoneda lemma, which tells us that natural transformations from representable func- 
tors to other functors are quite easy to describe. 

Lemma 2.1.2. (Yoneda lemma; see section 1.3 of JW$) Let C be any locally small 
category (so that Horn-sets are actually sets), G,H : C — )■ Sets any two (covariant) 
set valued functors, and suppose that G is representable by the object A e C (so that 

G{ ) = Homc(A, )). Let $ be a natural transformation from the functor G to H. 

Then for any object X of C and element (j) of Homc(A,X), = {H<P)(^aO-a))- 

Proof. Given : A — y X, consider the commutative diagram 

G(A) — H(A) 



H<f> 



G ( x ) -^r> H ( x ) 

Here G<p refers to the map that sends ip : A — > A to ip o <p, and H <ft refers to whatever 
map H sends to. Start with 1^ e G(A) in the upper left corner, chase it around 
both paths to H(X), and you get the equation claimed. □ 

As the names suggest, there is a duality between fc-algebras and fc-coalgebras. 

Definition 2.1.3. (see section 1.3 of [1]) Let k be a field, (C,A,e) a /c-coalgebra. 
The dual algebra to C is the set C* of linear functionals on C endowed with the 
following multiplication: if <f), ip e C*, then mult(0 ® ip) is the /c-linear map given by 
the composition 

If A is an infinite dimensional algebra, it is generally not possible to introduce a 
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coalgebra structure on the entire dual space of A. We can however introduce one on 
a certain subspace. 

Definition 2.1.4. (see section 1.5 of [16]) Let k be a field, A a fc-algebra. The finite 
dual coalgebra of A, denoted A°, is the subspace of A* consisting of those linear 
functionals on A which kill an ideal of A of finite codimension. For a 6 we define 
A(a) as follows: let a act on A <g> A by the composition 



and then pass this composition to the isomorphism (A <g) A) ~ A° <g> A . We define 
e : A° -)> fc by e(a) = a(l). 

If A is finite dimensional, then A° is all of A*. In this case there is a natural 
isomorphism A ~ A°* of algebras, and likewise a natural isomorphism C ~ C*° of 
coalgebras. 

This duality is functorial. Given an algebra map : A — > 5 we get a coalgebra 
map 0° : 5° — )■ A° defined by, for /3 G 5°, 4>°{(3) is the composition 



In a similar fashion we get algebra maps from coalgebra maps. 

2.2 G- modules and A-comodules 

Definition 2.2.1. (see section 3.1 of [16]) Let G be an affine group scheme over 
the ring k, and V a fc-module. A linear representation of G on V is a natural 

def 

transformation from the functor G( ) to the functor GLy( ), where GLy(R) = 

Autn(V <S> R)- We say also then that V is a G- module. 
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The concept of a linear representation, put this way, is a bit intimidating. How- 
ever, it is a consequence of (the proof of) the Yoneda lemma that linear representations 
correspond to very concrete things, called comodules. 

Definition 2.2.2. (see definition 2.1.3 of [I]) Let (C,A,e) be a coalgebra over the 
ring k, V a fc-module. V is called a (right) C-comodule if it comes equipped with a 
/c-linear map p : V — > V <S> C such that the following two diagrams commute: 



V 



V ®c 



1®A 



(2.2.1) 



V ®c > V® c®c 

P®1 



v®c 




(2.2.2) 



V ® fc 

Theorem 2.2.1. (^see section 3.2 of ITSjj) If G is an affine group scheme represented 
by the Hopf algebra A, then linear representations ofG on V correspond to A-comodule 
structures on V . 

It is for this reason that, in this dissertation, we shall quite often confuse the 
notions of G-module and A-comodule, and shall sometimes speak glibly of A-modules, 
representations for A, comodules for G, etc. 

Here is the correspondence. Given an A-comodule (V,p), we get a representation 
$ of G on V as follows: if g G G(R), then g is by definition a fc-homomorphism from 
A to R. Define g to act on V <S> R via the composition 



V -^V ®A^ V®R 



(2.2.3) 
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and then extend to V <8> R by i?-linearity. Conversely, let $ : G — > GLy be a 
representation. Then we may ask how id^ G G(A) acts on V, that is, what is the 
map 

$ A (id A ) : V® A -> F (g) A 

As we demand this map to be A-linear it is necessarily determined by its restriction to 
V ~ V® 1, and it is this map, call it p, which gives V the structure of an A-comodule. 
A Yoneda lemma type argument guarantees that, for any g G G(R), &n(g) is given 



by equation 2.2.3 



Let us be more explicit. Suppose k is a field and (V, p) a finite dimensional A- 
comodule. Fix a basis e 1; . . . , e n of V and write 



&i Q9 dij 



i=l 



Then the matrix (a.y) is the 'formula' for the representation of G on V. That is, 
for g G G(R), g acts on V <g> R via the matrix (g(a,ij)) in the given basis, and then 
extending by i?-linearity. 

Comparing equations 2.1.1 and |2.1.2 with equations 2.2.1 and 2.2.2 we see that 



(A, A) itself qualifies as a (usually infinite dimensional) A-comodule, and we call it the 
regular representation. Among other reasons, this is an important representation 
because 

Theorem 2.2.2. (see section 3.5 of JWI) If (V, p) is an n- dimensional A-comodule, 
then V is embeddable in the n-fold direct sum of the regular representation. 

Over fields, we have the following elementary yet eminently useful results. 

Theorem 2.2.3. (Fundamental theorem of coalgebras; see theorem 1.4.7 of Let 
k be a field, C a k-coalgebra. Then C is the directed union of its finite dimensional 
subcoalgebras. 
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Theorem 2.2.4. (Fundamental theorem of comodules; see theorem 2.1.7 of UV) Let 

k be a field, C a k-coalgebra, V a C-comodule. Then V is the directed union of its 
finite dimensional subcomodules. 



2.3 The Categories Rep^G and Comod 



A 



Let He a field, G an affine group scheme over k, A its representing Hopf algebra. 

Definition 2.3.1. ComocU is the category whose objects are finite dimensional A- 
comodules, and whose morphisms between (V, p) and (W, fi) are those fc-linear maps 
(f) : V — > W making the following commute: 



V 



w 



V® A 



0<g) 1 



W® A 



Definition 2.3.2. Rep fc G is the category whose objects are finite dimensional vector 
spaces V with a prescribed G-module structure $ : G — >■ GLy, and whose morphisms 
between (V, $), (W, are those linear maps : V — >■ W such that, for every /c-algebra 
R and element g G G(R), the following commutes: 



V ® R^^W ® R 



V ® i? 



Note well that whenever we write ComodA or Rep fc G it consists of only finite 
dimensional modules/comodules, in contrast to the notation of some other authors. 



The correspondence between G-modules and A-comodules given in theorem 2.2.1 
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is actually a functorial one (and it would be of no use otherwise). 

Theorem 2.3.1. The categories Rep k G and ComodA are equivalent as k-linear abelian 
tensor categories. 

This theorem is merely the assertion that a linear map : V — > W is a morphism 
for V and W as G-modules if and only if it is for V and W as A-comodules, and that 
the tensor structure on Comod/i (defined below) induces the usual tensor structure 
we expect to see in Rep k G. 

Definition 2.3.3. Let A be a Hopf algebra, (V, p), (W, p) finite dimensional A- 
comodules. 

1. Their direct sum is the A-comodule with underlying vector space V © W and 
comodule map given by the composition 



V © W ^ {V © A) © (W © A) ~ (V © W) ® A 



2. Their tensor product is the A-comodule with underlying vector space V © W 
and comodule map given by the composition 



V®W ^{V®A)®{W®A) 10Twist ^ V®W®A®A 1010mult ) V ®W © A 



3. The tensor product of two morphisms <p : V — > X, ip : W — > Y is the usual 
tensor product of linear maps 4> ® :V ®W — > X ®Y 

4. The trivial A-comodule, or trivial representation, is the A-comodule hav- 
ing underlying vector space k and comodule map p : k — > k © A given by 
p : 1 !->■ 1 © 1 

5. The dual of (V, p) is the A-comodule with underlying vector space V* and 
comodule map p : V* — > V* © A defined as follows. Let p* : V* — > B.om k (V, A) 
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be the map that sends the functional : V — > k to the composition V — > 
V <8> A k £g> A ~ A. Then /i is the composition 

V* A Rom k {V, A)~V*®A^>V*®A 

where S denotes the antipode of A 

6. The internal Horn of V and W, denoted Horn (I 7 , W), is the tensor product of 
the dual of V with W. 

An alternative, basis dependent definition of the dual (V*,p) of the comodule 
(V, p) is as follows. Pick a basis {ei, . . . , e n } of V, and let {cti, . . . , a n } be the dual 
basis of V*. Write p : ej !->■ X]j e « ® a u'5 an d set p : ctj (-)■ J^jOii ® a,j (note the 
transpose being applied). Then p is the composition 

V* -^V*®A ^ V* ® A 

In other words, if (a^) is the matrix formula for a representation on V in a given 
basis, then the dual representation on the dual space V*, in the dual basis, is the 
inverse of the transpose of (ay). 

2.4 Cohomology of Comodules 

The relevant definitions for cohomology in Comod^ make sense in any fc-linear 
abelian category, so we state them at this level of generality. The reader may consult 
[T7] or [T] for a more thorough introduction to these matters. 

Definition 2.4.1. Let C be a fc-linear abelian category, M, N objects of C. An n-fold 
extension of M by iV is an exact sequence 
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f : ->■ N ->• X n _x X —)> M —)> 



Two extensions are equivalent if there exist morphisms fa : X t — > Yj, such that 



N 



X n -i 







-> N 



n-1 




commutes. The set of equivalence classes (with respect to the equivalence relation 
generated by the relation 'being equivalent') of n-fold extensions of M by N is denoted 
Ext n (M,N). 

In case n > 1, the term 'equivalent' is abusive; it is a not necessarily symmetric or 
transitive relation. Nonetheless, this relation generates a unique equivalence relation, 
and it is this relation with respect to which Ext n (M, N) is defined. 

On the other hand, this relation is a bona fide equivalence relation on Ext 1 (M, N). 
This is because, as can be shown, the map O given by the definition of equivalence 
must necessarily be an isomorphism. 

Let 

f : ->■ N ->• X n _i ->• . . . ->■ X ->• M ->■ 

X ■ ->■ X ->■ F n _i io —> M —> 

be two n-fold extensions of M by iV with n > 1. The Baer sum of £ and x, denoted 
£ © X; is the extension gotten as follows. Let T and f2 be the pullback/pushout 
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respectively of the following two diagrams 



Y 



r 

A 



Y 



n-l 

A 



X > M X„_! < iV 

Then £ © x is the extension 

o ->■ n ->■ r ->■ x n _ 2 © ->...->Xi©yi->n->M->o 

For n = 1, the Baer sum is defined slightly differently. Let 

$:04JvAliAM40 

be two 1-fold extensions of M by iV. Let X be the pullback of Xi and X 2 under M, 
and 0', the unique maps making 



N N 
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commute. Let X — H> Y be the cokernel of 0', and ip the unique map making 

N ^ > M 

1 




Y 

commute. Set = 0r. Then the Baer sum £ © x is the extension 

^ N ^Y M ^ 

Let a 7^ be a scalar, £ an n-fold extension as above, and let M X n _! be the 
first map in the extension. Then we define the scalar multiplication of a on £ to 
be the extension 

of : ->■ M ^ X n _i ->■ . . . ->■ X ->■ N ->■ 

with all of the other maps and objects staying the same. For a = 0, we define 0£ to 
be the trivial extension (defined below). 

Theorem 2.4.1. For any two objects M and N of a k-linear abelian category, Baer 
sum and scalar multiplication respect equivalence classes, and Ext n (M, N) is a vector 
space under those operations. 

The additive identity of Ext n (M, N) is called the trivial or split extension. In 
the case of Ext 1 , it can be identified as the equivalence class of the extension — > 
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Chapter 3 



Tannakian Duality 



Here we present the basic facts concerning the correspondence between categories 
of finite dimensional representations of affine group schemes over a field, and so-called 
neutral tannakian categories. We have no intention of giving a full account, especially 
concerning proofs; we shall mostly content ourselves with giving the definition of a 
neutral tannakian category, and, following the proof given in [5] , describing a method 
for recovering the representing Hopf algebra of such a category. The reader may 
consult [15] or [3], but the development given here follows almost exclusively that of 
[5]. The reader may also consult [7] for an excellent introduction to the theory of 
abelian categories, and pQ] for a good account of tensor categories in the abstract 
(there referred to as monoidal categories, but without many of the assumptions we 
shall be placing on them). 

The theory of tannakian categories, while having broader implications than what 
we will be discussing is, as far as we are concerned, a successful attempt to answer 
three natural questions about the category Rep fc G (equivalently, ComodA, where A 
is the representing Hopf algebra of G). Firstly, to what extent does purely categor- 
ical information about the category Rep fc G determine the group G? The answer is, 
completely, if one allows for one piece of external information (called a fibre functor). 
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Secondly, can we recover in some constructive way the Hopf algebra A from ComocU? 
The answer again is yes, and it is to this that we will devoting most of our time. Fi- 
nally, is there a set of axioms one can write down which are equivalent to a category 
being Rep fc G for some k and G? The answer again is yes, and these axioms serve as 
the definition for a neutral tannakian category. 

3.1 A Motivating Example 

To motivate the definitions given in the next section it will help to keep in mind 
the simplest and yet most important example, the category Vecfc consisting of all 
finite dimensional vector spaces over a field k, with morphisms being /c-linear maps 
between the vector spaces (or, if you like, Rep fc Go, where Go is the trivial group 
represented by the Hopf algebra k). Firstly, Vec^ is a /c-linear abelian category. This 
means, among other things, that the Hom-sets themselves have a A;-linear structure 
on them, and composition of morphisms is bilinear with respect to this structure. 
It also means that Vec^ satisfies some nice regularity conditions, and that certain 
desirable constructions are always possible within it: finite biproducts always exist 
(in the form of the usual direct sum of vector spaces), kernels and cokernels always 
exist, and all monomorphisms and epimorphisms are normal (every injective map is 
the kernel of its cokernel, and every surjective map is the cokernel of its kernel). 

As it happens, this is not quite enough to recover fully the fact that Vecfc is indeed 
Vecfc. Enter the tensor product. To every pair of vector spaces V and W we assign an 
object called V ® W, and to every pair of morphisms V A- X and W Y we assign 
a morphism, denoted V <8> W X <g> Y . We also have that, for every composable 
pair 0, if) and composable pair a, b, ((f) <S> a) o (if) <g) b) = ((f) o iji) <g) (a o b). This amounts 
to the assertion that <S> is a bifunctor on Vec fc . This bifunctor <g> is a bilinear functor 
in the sense that, for any c G k, (c(f) + if>) <S>r) — c(<p (g> rj) + (tp ® rj), and similarly for 



29 



the other slot. 

We know also that <g> is a commutative operation. That is, for every pair of vector 

commas 

spaces A and B there is a natural isomorphism A (g) B ~ 5 (g) A (namely the 
map a <g> ft (->■ ft (g) a) . Naturality here means that for every pair of maps A A- X and 
B — ?> y the following commutes: 



A® 5 ^> B® A 



commx Y 



Similarly, <g is naturally associative, given by the natural isomorphism (A (g> B) <g> 

assoc^ b q 

C ~' A g) (B <g> C) (namely (a (g) 6) <g> c >->■ a <g> (6 <g> c)). 

Vecfc also has an identity object for <g>, namely the vector space k. This means 
that to every vector space V, there is a natural isomorphism V ~ v k g) V (namely 
v i — y 1 eg v^j , satisfying diagrams analogous to the above. In the context of abstract 
tannakian categories this identity object is denoted as 1. 

We also mention that the isomorphisms comm and assoc satisfy some coherence 
conditions with one another. These are expressed by the so-called pentagon and 
hexagon axioms, to be discussed in the next section. 

Recall the universal bilinear mapping property of the tensor product. For every 
V and W there is a bilinear map ® \V xW — > V <g>W with the following property: 
to every bilinear map V x W A- Z there is a unique linear map V ®W -4 Z such 
that the following diagram commutes: 




VxW 
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What this gives us is a natural isomorphism between linear maps on V <8> W and 
bilinear maps onVx W. But a bilinear map V x W — )■ Z is just another name for a 
linear map V — > Hom(W, Z). Thus we have an isomorphism 

Hom(V <g> W, Z) ~ Hom(V, Hom(W / ", Z)) 

Another way of stating this is that Hom( ® W 7 , Z) is a representable functor, and 

its representing object is HomfW, Z). Vec fc enjoys the property that, for any objects 
V and W, Hom(V, W) is also an object of Vec fe . For a category in which this is not 
exactly the case, we have a different name, when it exists, for the representing object 

of Hom( <g) W, Z): we call it internal Horn, and denote it by Hom (W, Z). The above 

discussion can thus be summed up as saying that, in Vec^, internal Horns always 
exist. 

Of special interest then is, for any vector space V, the object Hom(V, 1), which in 
Vecfc can be identified as the space of linear functionals on V. We denote this object 
by V y . It is well known that any vector space V is naturally isomorphic to V yy via 
the map v >->■ ev„, where ev„ is the map that evaluates any functional V — >■ k at the 
element v. We say then that all objects of Vecfc are reflexive in the sense that v ev v 
is always an isomorphism. 

For any vector spaces X 1: X 2 ,Y 1: Y 2 , there is an obvious isomorphism Hom(X 1 , Y"i)<8) 
Hom(X 2 , Y 2 ) ~ Hom(X 1 (g)X 2 , Yi®Y 2 ), namely the map that sends the element 4>®il) 
to the map of the same name. This isomorphism can be thought of as expressing the 
fact that the tensor product, acting on Hom (X, Y) as an object, is compatible with 
its action on it as a Hom-set. 

In Vecfc, End(l) = End(A;) can be identified with the field k itself, if we take 
addition to be addition of maps and multiplication to be composition of morphisms. 
Thus we can say that End(l) is a field. With this in mind, and given everything else 
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we've done, we can actually define the /c-linear structure on Vecfc without assuming 
it. If c : k — > k is the linear map given by multiplication by the constant c G k and 
4> : V — > W any linear map, we can define c0 as the composition 

Finally, there is a so-called fibre functor on Vec^, i.e. an exact, k-linear tensor 
preserving functor from Vec fc to VecEnd(i)- Here, we can simply take this functor to 
be the identity Vec fc — > Vec fc . 

The preceding discussion amounts to the assertion that Vec fc is a neutral tannakian 
category, which we formally define now. 

3.2 Definition of a Neutral Tannakian Category 

Definition 3.2.1. An abelian category is a category C with the following proper- 
ties: 

1. For all objects A,BeC, Rom(A, B) is endowed with the structure of an abelian 
group, and composition of morphisms is bilinear with respect to this structure 

2. Every pair of objects in C has a biproduct 

3. Every morphism has a kernel and a cokernel 

4. Every monomorphism is the kernel of some morphism, and every epimorphism 
is the cokernel of some morphism 

Definition 3.2.2. Let C be a category endowed with a bifunctor (g) : C x C — > C, 
and denote by, for objects A, B and morphisms (f>,ip, A ® B = f <g>(A, B) and <g> ip = f 
<g>(0, ip). Then <g> is called a tensor product, and C is called a tensor category, if 
the following hold: 
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1. There is a functorial isomorphism assocx,y,z X <g> (Y <g> Z) ~ (X <g> Y) <g> Z 

2. There is a functorial isomorphism commx,y : X ® F ~ F <8> X 

3. There is an identity object, denoted 1, and a functorial isomorphism unity 
V ~ 1 (8) V inducing an equivalence of categories C — > C 

4. (pentagon axiom) For all objects X, F, Z and T, the following commutes: 

(X <g> F) <g> (Z <g> T) 



X <g) (F <g> (Z <g> T)) 



((X®F) <g> Z) ®T 



X<g)((F ®Z)®T) 



{X ®{Y ®Z))®T 



(the obvious sub-scripts on assoc have been omitted) 



5. (hexagon axiom) For all objects X, Y and Z, the following commutes: 



(X <g> F) <g> Z 




X <g) (F <g) Z) 



X® (Z<g)F) 



Z <g> (X <g> F) 



(Z®X) ®F 




commcgil 



(X <g> Z) <g> F 



33 



6. For all objects X and Y, the following commute: 



unit 



X <8> Y > 1®(X®Y) 



unit®l 

x ® y > (l ® x) ® y 



unitcgil 



l®unit 



comm® 1 



(i ® x) g> y 



x ® (l ® y) {x ® i) ® y 



For reasons of convenience our definition of a tensor category is a slight deviation 
from that given on page 105 of [5]. There conditions 3. and 6. above are replaced 
with the seemingly weaker requirement that there exist an identity object U and 
isomorphism u : U — )■ U ®U such that X h-> £/ g) X is an equivalence of categories. 
However, proposition 1.3 on that same page makes it clear that Deligne's definition 
implies ours, so we have not changed anything. 

What we call a tensor category others might call a monoidal category, and our 
demand that it be, e.g., naturally commutative is quite often not assumed by other 
authors. Saavedra in would in fact call this an ACU tensor category, indicating 
that is associative, commutative, and unital. We shall have no occasion to consider 
any tensor categories but this kind, so we call them simply tensor categories. 

The significance of the pentagon and hexagon axioms is that, loosely speaking, 
they introduce enough constraints to ensure that any diagram that should commute, 
does commute. The reader should see jH] and [5] for more on this. 

We also note that the identity object 1 and the isomorphism unit are not demanded 
to be unique. However, any two such are isomorphic up to a unique isomorphism 
commuting with the unit isomorphisms, so it is unique for all intents and purposes; 
see proposition 1.3 of [5]. 

We define an abelian tensor category to be an abelian category equipped with 
a tensor product in such a way that <g> is a bi-additive functor, i.e. (0 + ip) ® a = 
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<g> a + ip <8> a, and similarly for the other slot. 

Our goal now is to define what is called a rigid abelian tensor category. This is 
defined to be an abelian tensor category in which all internal Homs exist, every object 
is reflexive, and for all objects Xi, X 2 , Yi, Y 2 , a certain natural map 

Hompfi, Yx) ® Hom(X 2 , Y 2 ) ->• HomfX <g> X 2 , ^ ® y 2 ) 

is always an isomorphism. We define now what these things mean in purely categorical 
terms. 

Let X and Y be objects of the tensor category C and consider the contravariant 

functor Hom( £g> X, Y) from C to the category of sets. It sends any object T to 

Hom(T £g> X, Y) and any morphism T — >■ to the morphism Hom(W / (g) X, Y) 
Hom(Tcg)X, y) defined by, for W <&X — > Y, the image of ip under is the composition 



T ®X^\W®X^Y 



Suppose that, for fixed X and Y, this functor is representable, and call the repre- 
senting object Hom (X, Y). Hom (X, Y) is by definition an internal Horn object 
for X and Y (so called because, e.g. in Vec/c, internal Horn is just Horn). Then 

we have a natural isomorphism Q going from the functor Hom( , Hom (X, Y)) to 

the functor Hom( <g> X, Y). If we plug in the object Hom(X, Y) to each slot and 

apply ^ Hom (x.y) to the element id G Hom( Hom (X, Y), Hom (X, Y)), we get a map 
Hom(X, Y) <g) X — > Y, which by definition we call evx,Y (so called because, in Vec^, 
evx,y is the evaluation map <f> £g> x > <f>(x)). 

The significance of the map evxY is that, analyzing the situation in light of the 



Yoneda lemma (page 18), we find that for any morphism <p in Hom(T, Hom (X, Y)), 
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applying the isomorphism Q T yields 



Q T ((f>) = (0 ® 1) o ev x ,Y 

which is to say, the following is always commutative: 

T®X 




We suppose now that for any objects X and Y, Hom (X, Y) exists. We define 
the dual of X, denoted X v , to be Hom (X, 1), where 1 is the identity object for our 
tensor category, and we simply write ev x for evx,i, which is a map X y ® X — >■ 1. In 
Vecfc, evx is the familiar map <fi <8> x (-> <f>(x). 

We now proceed to define a map u x ■ X — > X vv , which in Vecfc will correspond to 
the usual evaluation map x >->■ ((f) (->■ (f)(x)). We have, for any X, an isomorphism fix 
between 

Hom(X, X vv ) ^ Hom(X ® X v , 1) 

Define t x '■ X — > X vv to be the map on the left hand side of the above isomorphism 
which corresponds to the composition X (g) X v C -^T X v ® X — ^ 1 on the right hand 
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side. In other words, l x is the unique map making the following commute: 




(3.2.1) 



X vv <g) X v > 1 



cv x v 

We define an object X of C to be reflexive if the map tx just constructed is an 
isomorphism. 

Consider the following composition: 

(RomiXu Y 1 ) ® Hqm(X 2 , Y 2 )) ® {X 1 ® X 2 ) ^ (3.2.2) 
(HompT!, Y x ) ® Xi) ® (Hom(X 2 , F 2 ) ® X 2 )) Yi ® Y~ 2 

where the first isomorphism is the obvious one built by application of the comm and 
assoc isomorphisms. Call the above composition ^. Then there is a unique map, call 
it $, making the following commute: 

(Hompfi, Y x ) ® Hom(X 2 , Y 2 )) ® {X x ® X 2 ) 




(3.2.3) 



Hom(Xi ® X 2 , Fi ® y 2 ) ® (Xi ® X 2 ) ^> Y x ® Y~ 2 

Definition 3.2.3. An abelian tensor category is called rigid if Hom (X, Y) exists for 
all X and Y, all objects are reflexive, and for any quadruple Xi, X 2 , Yi, F 2 , the map 
$ just constructed is an isomorphism. 
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Next consider the Horn-set End(l). As composition is demanded to be bilinear 
with respect to the additive structure on Horn-sets, End(l) is a ring with unity under 
addition and composition. We say then 

Definition 3.2.4. A rigid abelian tensor category is called a tannakian category 
if the ring End(l) is a field. 

Assume now that C is tannakian and let k be the field End(l). Then C has a k- 
linear structure forced upon it as follows. If c is an element of k (that is, a morphism 
1 4 1) and is a morphism from V to W , then we define the scalar multiplication 
c0 to be the composition 

As <S> acts bilinearly on morphisms, we have c(0 + ip) = aj) + cip for all c G End(l), 
0, ip e Hom(V, W). Thus, Hom(V, W) is a vector space over k. 

If C is any category, then being tannakian is not quite enough for us to conclude 
that it is the category of representations of some affine group scheme over the field 
End(l). We need the additional fact that objects of C can, in some sense, be 'thought 
of as concrete finite dimensional vector spaces, and morphisms as concrete linear 
maps between them. This is the role fulfilled by a fibre functor, a certain kind of 
functor from C to Vec^, where k is the field End(l). 

We need to define first what is meant by a tensor functor F : C — >■ V, where C 
and V are tensor categories. We denote with the same symbol <g> the tensor product 
in both categories. We denote by assoc the requisite associativity isomorphism in C, 
and by assoc' that in V; similarly for the natural isomorphisms comm, unit, and the 
identity object 1. 

Definition 3.2.5. Let C and V be tensor categories, and F : C — > V a functor. F is a 
tensor functor if there is a functorial isomorphism c x ,y '■ F(X)<g>F(Y) — =->■ F(X®Y) 
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satisfying 



1. For all objects X, Y, Z of C, the following commutes: 



F{X) ® F(Y ® ^) 




F(X) ® ® F(Z)) 



(F(X)®F(Y))®F(Z) 



F(X® (Y g> Z)) 



F(assoc) 



f((x® y) ® z) 




F(X® Y) <g>F(Z) 



2. For all objects X and Y of C, the following commutes: 



F(X) ® F(Y) F(X <g> Y) 



F(comm) 



f(y) <g> F(x) — > F(y <g> x) 



3. If (1, unit) is an identity object of C, then there is an identity object (1', unit') of 
V such that F(V) = 1', and for every object X of C, \ua.t' F , x \ = F(unitx) ° cx,i- 

Again, condition 3. appears stronger than condition (c) on page 114 of [5], but 
they are actually equivalent, again by proposition 1.3 of [5]. 

If C is any tannakian category over the field k = End(l) then we define a fibre 
functor on C, usually denoted u, to be any exact, faithful, ^-linear tensor functor 
from C to Vecfc. We can finally define 
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Definition 3.2.6. A neutral tannakian category is a tannakian category equipped 
with a fibre functor. 



3.3 Recovering an Algebraic Group from a Neu- 
tral Tannakian Category 

Let G be an affine group scheme over the field k and let u : R,ep k G — > Vec^ be 
the forgetful functor, i.e. the functor which sends every representation of G to its 
underlying /c-vector space, and every map to itself. If R is a /c-algebra, we define 
Aut®(a>)(i?) to be the collection of tensor preserving automorphisms of the functor 
oj r : Rep fe G — ?■ Mod/j. Here u R is the functor which sends any object X of Rep fc G to 
the i?-module X®R (strictly speaking we should write u{X)®R, but we deliberately 
confuse X with its underlying vector space oj(X) to keep the notation simple), and 
sends any morphism X -4 F to the i?-linear map X ® R ^\ Y ® R. To be more 
explicit 

Definition 3.3.1. An element of Aut®(u))(R) is a family (Ax : X e Rep fe G), where 
each Ax is an i?-linear automorphism of X <g> _R, subject to 

1. Ai is the identity map on R ~ fc <g> i? 

2. Ax®y = Ax ® Ay for all X, F e Rep fc G 

3. For all morphisms X -A Y in Rep fc G, the following commutes: 

X® i?^^X® R 



F®i?— ^>F® # 

Ay 
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If g is an element of the group G(R) then it is trivial to verify that g defines an 
element of A\it®(u)(R), just by working through the definitions. If, for X £ Rep k G, 
we write gx for the automorphism X <g> R — > X ® R defined by the representation 
G — > GL(X), then the above three requirements are all satisfied. 1 is of course the 
trivial representation in Rep fc G, so g acts identically on 1 by definition. The action 
of gx®Y on X <g> Y is defined by the equation gx®Y = gx <8> gy , giving us 2., and 3. is 
true since morphisms in Rep fc G must by definition commute with the action of g. 

Thus, we have a natural map from the functor G to the functor Aut®(w). We can 
now state one half of the principle of tannakian duality: 

Theorem 3.3.1. (proposition 2.8 of f^/) If G is an affine group scheme over the field 
k, then the natural map of functors G — > Aut®(u) is an isomorphism. 

Stated more plainly: the only tensor preserving automorphisms of the functor u R 
are ones that are given by elements of G(R). We see then that the category Rep^G, 
along with the forgetful functor u, completely determines the group G: it can be 
recovered as the affine group scheme Aut®(a;). 

This first half of our main theorem points the way to the second half. Starting 
with an abstract neutral tannakian category C with fibre functor u, the functor G = 
Aut^a;) is itself an affine group scheme such that C is tensorially equivalent to Rep fe G. 
That is: 

Theorem 3.3.2. (theorem 2.11 of [5]) Let C be a neutral tannakian category over 
the field k with fibre functor oj. Then 

1. The functor Aut®{u) on k-algebras is representable by an affine group scheme 
G 

2. oj defines an equivalence of tensor categories between C and Rep k G 

This is the principle of tannakian duality. 
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The remainder of this section is devoted to, following the proof of the above 
theorem given in [3], giving an 'algorithm' of sorts for recovering the representing 
Hopf algebra A from an abstract neutral tannakian category. We shall not justify 
most of the steps taken; the interested reader should see the actual proof for this. For 
the remainder, C will denote a fixed neutral tannakian category over the field k with 
fibre functor u. We denote the image of the object X under the functor uj as u(X), 
and the image of a morphism <p under u simply as 0. The usual names for 1, comm, 
assoc, <g>, etc. also hold here. 

Definition 3.3.2. (see page 133 of [5]) For an object X of C, (X), the principal 

subcategory generated by X, is the full subcategory of C consisting of those objects 
which are isomorphic to a subobject of a quotient of X n = X © ... © X for some n. 

Note firstly that (X) is not itself a tannakian category, in general not being closed 
under the tensor product; it is however a fc-linear abelian category. Note that Y G (X) 
if and only if (Y) C (X). We can say then that C is the direct limit of its principal 
subcategories, with the direct system being the inclusions (Y) C (X) when applicable. 

Definition 3.3.3. (see lemma 2.13 of [5]) For X an object of C, we define End(o;|(X)), 
the collection of all endomorphisms of the fibre functor u restricted to (X), to 
consist of families A = (Ay : Y G (X)) such that Ay : uj(Y) — > uj(Y) is a fc-linear 
map, and for every C-morphism Y A Z, the following commutes: 

uj{Y) — -^—^ u(Y) 



■ 



u(Z) — - — >u(Z) 

An important point, used often in this dissertation, is the fact that every A G 



42 



End(w|(X)) is determined by Ax- If k '■ X — > X n is the i th inclusion map then 



Ax 



u(X) 



u{X n ) 



must commute for every i, which clearly forces Ax™ = A x . If Y is a subobject of some 
X n with Y X n injective, then Ay must commute with 



u{Y) 



c > 



u{X" 



u{Y) 



u(X n ) 



and since i is injective, Ay is unique in this respect. Finally, if Z is a quotient of some 
Y G (X) with Y a subobject of X n , then we have a surjective map Y — ^ Z and the 
commutative diagram 

w(ker(vr)) ^!_> u>(Y) — ^-» uj{Z) 



a 



kcr(7r) 



cu(ker(7r)) c —> u(Y) 



-» 



u(Z) 



with i the inclusion of the kernel of ir into F. By commutativity of the left square 
Ay must stabilize w(ker(7r)). This shows that there is at most one X z making this 
diagram commute, hence Xz is determined by Ax as well. 

Therefore it does no harm to confuse End(o;|(X)) with {Ax : A G End(w| (X))}, 
its image in End(u(X)); we refer to this subalgebra of End(cu(X)) as L X - 

Now suppose that X G (Y), which is the same as saying (X) C (Y). If A G 
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End (a; I (y)) it is straightforward to check that Ax G L X - This gives, for every such 
X and F, a canonical map from Ly to Lx, denoted Txy\ we call this the transition 
mapping from L Y to L X - It is clear from the definition that, for X G (Y) and 
F G (Z), then also X G (Z), and the diagram 

Lz : — > Ly 

Tx,Y 

V 

L x 

commutes, which give the Lx,X G C the structure of an inverse system. 

For each X G C let B x be the dual coalgebra to L X - Then from the /c-algebra 

T X Y ^ X Y 

maps Ly — Lx we get A;-coalgebra maps Bx — 4- By. Thus, for objects X, Y and 
Z of C with X G (F) and F G (Z), the diagram 

-Bz < : -By 




commutes, giving the B X ,X G C the structure of a direct system. Then let 

£ = Inn 5 X 

be its direct limit; this -B is the underlying coalgebra of our eventual Hopf algebra. 

We now define an equivalence of categories F : C —t Comod B which carries the 
fibre functor u into the forgetful functor Comod B — > Vec fc , that is, such that the 
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diagram 



C > Comods 




forget 



Vec fc 



commutes. Let A be a finite dimensional fc-algebra, A° its dual coalgebra, and V a 
finite dimensional A;- vector space. We define a map Hom fc (A® V, V) Hom fc (V, V® 
A°) as follows. For p G Hom(A g) V, V), is the composition 

V ^k®V dias0M ) End fc (A°) ® 1/ ) ® A° ® V 

Id^Twist ; A o, y ^ *®Id®Id ; ^ ^ ^ ^ AO P0ld > y ^ ^ 



Reading from left to right, the various maps occurring in this composition are defined 
as: ~ is the canonical isomorphism V ~ ® V, diag is the map that sends 1 6 k to 
Id G Endjfc(A°), ~ is the canonical isomorphism Endfc(y4°) ~ A°* ® Twist is the 
obvious commutativity isomorphism, and ~ is the canonical isomorphism A°* ~ A. 

Lemma 3.3.3. Let A be a finite dimensional k-algebra, A° its dual coalgebra, and 
V a finite dimensional k-vector space. Then the map $ : Hom^A £g> V, V) — > 
HomkiV, V ® A°) just defined is a bijection. Further, p G Homk(A ® V, V) defines a 
valid A-module structure on V if and only if $(p) G Homk(V, V ® A°) defines a valid 
A° -comodule structure on V . 

The reader should see proposition 2.2.1 of [I] for a proof of this fact. However, 
be aware that our map $ is actually the inverse of the map they consider, and we 
have replaced the coalgebra C and dual algebra C* with the coalgebra A° and algebra 
A°* ~ A. 

For any X G C the vector space oj(X) is in the obvious way a module for the 
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k- algebra Lx- Then according to the previous lemma u>{X) carries with it also the 
structure of a comodule over Bx, call it px- Then if Bx — ^> B is the canonical map 
given by the definition of B as a direct limit, we get a -B-comodule structure on u(X), 
call it p, via the composition 

p : u(X) ^ u(X) ®B x l ^H uj(X) ® 5 

If X G (y) for some Y, then similarly cu(X) is a module over Ly (via the transition 
mapping Ly -^4 Lx), hence a comodule over By, and yet again over B. The various 
commutativities of the relevant diagrams ensure that we will get the same .B-comodule 
structure on u(X) no matter which principal subcategory we consider it to be an 
object of. We therefore define the image of the object X under the functor F to be 

F(X) = (u(X),p) 

That is, the -B-comodule with underlying vector space u(X) and comodule map 
p : u)(X) -> u)(X) <g> B just defined. 

It is tedious but straightforward to argue that, if X — — )■ Y is a morphism in the 
category C, then working through the definitions of Lx, Bx and B, (the image under 
the fibre functor of) is actually a map of 5-comodules. We therefore define the 
image of a morphism under F to be the same map between the vector spaces oj{X) 
and oj(Y). 

Theorem 3.3.4. The functor F : C — >■ Comods just defined is an equivalence of 
categories. 

That F is a faithful functor is clear from the fact that u is as well. The claim 
that F is both full and essentially surjective is however by no means obvious; see 
proposition 2.14 of [3] for a proof of this. 
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We have thus far recovered a A;-coalgebra B and an equivalence between our 
abstract neutral tannakian category C and Comods- What remains is to recover 
the multiplication on B. As the usual tensor product on comodules over a Hopf 
algebra is defined in terms of its multiplication, it is not surprising that we should 
turn the process around to recover the multiplication from the tensor product. 

Let B be a fc-coalgebra and u : B ®fc B — >■ B be any fc-homomorphism. Then we 
can define a bifunctor <p u : Comods x Comods — > Comod# as follows: it sends the 
pair of comodules (X, p), (F,/x) to the co module <fi u (X, Y) having underlying vector 
space X £g>fc Y and comodule map given by the composition 



X®Y ^X®B®Y®B 10Twist01 ) X®Y®B®B l ^X®Y®B 



(In case B is a Hopf algebra and u is mult, then this is by definition the tensor product 
on Comods.) What is not quite obvious is that in fact all such bifunctors arise in 
this fashion. 

Proposition 3.3.5. (see proposition 2.16 of f^j) For any k-coalgebra B, the map 
u i — y cj) u defines a bijective correspondence between the set of all k-homomorphisms 
u : B <S) B —7- B, and the set of all bifunctors F : Comods x Comods — > ComodB 
having the property that the underlying vector space of F(X,Y) is the tensor product 
of the underlying vector spaces of X and Y . 

So then, let us define a bifunctor on Comod^ which for the moment we call □. F 
is an equivalence, so it has an 'inverse' functor, call it F~ l . Then for two S-comodules 
S and T, we set 

SUT = F(F-\S)®F-\T)) 
where ® on the right refers to the given tensor structure on C. We define □ to act 
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on morphisms in an analogous fashion. As the diagram 

F 

C > Comods 

forget 

V 

Vec fc 

commutes, it is easy to see that □ as a bifunctor satisfies the hypothesis of the previous 
proposition; hence, □ is uniquely of the form n for some u : B <8> B — > B. This u, 
call it now mult, is the recovered multiplication on B, finally giving B the structure 
of a Hopf algebra. 

We close by mentioning that the necessary conditions needed for B to be a com- 
mutative Hopf algebra follow from certain properties assumed about ® on C. For 
instance, it is the existence of the natural isomorphisms comm and assoc which guar- 
antees that mult should be a commutative and associative operation, and the existence 
of an identity element for mult follows from the existence of the identity object 1 for 
g). The interested reader should see pg. 137 of [5] for more on this. 

3.4 Recovering a Hopf Algebra in Practice 

Here we record some results which will later be useful in computing the repre- 
senting Hopf algebra for a given neutral tannakian category, according to the method 
outlined in the previous section. 

3.4.1 A Categorical Lemma 

Much of the work done in this dissertation entails the computing of direct /inverse 
limits over very large and unwieldy collections of objects. The following lemma will 
allow us at times to drastically simplify our computations. 
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If (J, <) is a directed set, we say that ( J, <) is a sub-directed set if J is a subset 
of J, J is directed, and whenever j < k in J, j < k in I. Note that we do not demand 
the converse to hold; in case it does, we call J full. We say the sub-directed set J is 
essential in / if, for every i £ I, there is a j G J such that i < j in /. 

Lemma 3.4.1. Let C be any category, I a directed set, {Xi} a collection of objects 
indexed over I, and {Xj Xj} a direct system for the X { over I, and let 



be the direct limit of this system. Let J <Z I be a (not necessarily full) sub-directed 
set, and let 



be the direct limit over J . Then if J is essential in I , these two direct limits are 
isomorphic, via a unique isomorphism commuting with the canonical injections. 

Proof. It is well known that any two direct limits for the same system are isomorphic 
in the above mentioned way. Thus, we will prove the theorem by showing that the 
Xj, the direct limit over J, can also be made into a direct limit object for the Xi 
over all of /. For any i G /, define a map Xi A Xj as ipi if i G J, and in case % ^ J, 
as the composition 




Xj 



Xi 




x % 




Xj 
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where j is any member of J such that i < j. This is well-defined: if k G J is any other 
such that i < k, let / be an upper bound for j and k in J. Then every sub-triangle of 
the diagram 



Xj 




X 

commutes, and thus so does the outermost diamond. 

We claim that with these Xj A Xj, Xj is a direct limit for the Xj over all of /. 
Let Y be any object with morphisms Xj Y such that, for every % < j e /, the 
following commutes: 




Then this diagram obviously commutes for every i < j e J, and the universal property 
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of Xj guarantees a unique map Xj Y making 



Y 




commute for every j, k G J. But this map t also satisfies the universal property 
required for Xj to be a direct limit over all of I. For if i, I e / with i <l, then let 
j,k G J with i < j , I < k, and j < k, and the following also commutes: 



Y 
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and hence so does 



Y 




This map t is still unique, since satisfying universality over all of / is clearly a more 
stringent requirement than doing so over all of J. 

□ 

There is an obvious analogue to this lemma as concerns inverse limits which we 
state but do not prove. 

Lemma 3.4.2. Let C be any category, I a directed set, {Xi} a collection of objects 
indexed over I, and {Xi Xj} an inverse system for the Xi over I, and let 



Xi^ 



X, 



be the inverse limit of this system. Let J C I be a (not necessarily full) sub-directed 
set, and let 



X, 





Xi 



x 3 
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be the inverse limit over J. Then if J is essential in I, these two inverse limits are 
isomorphic, via a unique isomorphism commuting with the canonical projections. 



3.4.2 Computing End(u\(X)) 



Recall from page 43 that if X is an object of C, we define Lx to be the subalgebra 
of EncU(co>(X)) consisting of those linear maps which are 'starting points' for a full 
endomorphism of the fibre functor restricted to (X). Here we describe a practical 
method for computing Lx, which is gleaned from pages 132, 133 of [5]. The definition 
of Lx given, a priori, seems to require that we look at arbitrarily large powers of X 
to discover if a given transformation of oj(X) is or is not in Lx, but the method 
described here shows that we need only look inside a fixed power of X (X dim ^ x » in 
fact). 

Let n = dim(o;(X)) and write X n = X\ @X 2 © . . - ®X ni where each Xi is simply a 
labelled copy of X. If Y — > X n is any embedding then we can write = ipi® . . .@ip n , 
where Y ^ > Xi is the i th component of ip. 

As oj(X) is n-dimensional, so is u(X)*, so fix an isomorphism a : k n — > u(X)*, 
and let e\ = (1, 0, . . . , 0), . . . , e n = (0, . . . , 0, 1) be the standard basis of k n . From this 
ip and a we define a linear map ip a : uj(Y) — > u(X)* <S> to(X) as follows: for a vector 

y e u{Y), 

n 

i=i 

If we identify uj(X)* <Z>u(X) with Endfe(o;(X)) in the usual fashion, we may speak of 
whether the image of ip a does or does not contain the element id : uj(X) — > u(X). 
Further, exactness and faithfulness of the functor u imply that, just as in Vec^, the 
concept of a "smallest" object having a given property make sense. So we define 

Definition 3.4.1. For an object X and fixed isomorphism a : k n — > u(X)*, P x is 
the smallest subobject of X n having the property that the image of oj{P x ) under ip a 
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contains id : uj(X) — > u (X), where ip is the embedding P% — > X n . 

It is a completely non-obvious fact that 

Theorem 3.4.3. (see lemmas 2.12 and 2.13 of JE/) For any a, the image of co(Px) 
under ip a is the algebra L X - 

We have no intention of justifying this, although we do mention the reason that 
a can be chosen arbitrarily. Consider the subobject X n itself of X n , with the em- 
bedding being ip = id. Certainly if one chooses a different (3 then the subobjects 
Fx and P x will be different, but their images under ip a and ipp respectively will not 
change. Obviously ip a and ipp are isomorphisms of vector spaces, and thus we have a 
commutative diagram 



u(X n ) ^> u(X n ) 

Ipa 

V 

uo{X) y ®u{X) 

where <f>p }0l is a linear isomorphism. But it can in fact be shown 0^ jQ must in fact be 
(the image of) an actual isomorphism between the object X n and itself in the original 
category. Such a (j)p t0l must then preserve the notion 'smallest subobject', and so the 
computation will always yield the same subspace L x of End(u;(X)). 

Example: consider the following module X for the additive group G a over a field 
k, with matrix formula 

1 x 

V° 1 / 

in the basis /i,/2 for u(X). We will compute End(a;|(X)) using the method outlined 
above. This is a 2-dimensional module, so we consider the module X 2 , with matrix 
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formula 

tlx \ 

1 

1 x 

V l ) 

in the basis A,i' A,2, f2,i, A,2 for u(X 2 ). Next we consider an arbitrary subobject of 
X 2 . As any subobject factors through the identity mapping X 2 — > X 2 , it does no 
harm to choose ip = 1. Then the coordinate maps ipi,ip2 '■ ^(X 2 ) — > u>(X), using the 
usual canonical injections X — > X 2 , are 

4>1 ■ fll ^ fl, fl2 / 2 , /21 ^ 0, / 22 !->■ 

^2 : /11 >->■ 0, /12 H> 0, /21 1— >■ A, / 22 / 2 

For the isomorphism a : k 2 — > u(X)*, let's keep life simple and choose a : (1,0) 1— >■ 
/*,(0, 1) !->■ / 2 *, where /*,/| is the dual basis for uu(X)*. From this ip and a we 
compute ip a : w(X 2 ) — > uj(X)* ® w(X), which has formula 

^«(x) = a((l, 0)) <g> ^i(x) + a((0, 1)) <g> ^(z) 

and thus 

i> a -hi >->■ A ® /1 
/12 ^ A* ® A 
/21 ^ A* ® A 
A2 >-> A* ® A 

Now, to compute , we ask: what is the smallest subobject of X 2 such that, 
under the map ip a , contains the identity map X — > XI We identify the identity map 
of course as the element A* ® A + A ® A e °°(X)* <g> uo(X). This projects back 
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to, under ip a , the element fn + / 22 of u(X 2 ), which, in the given bases, corresponds 
to the vector (1,0,0,1). A quick computation shows that the smallest subspace of 
w(X) 2 stable under X and containing / n + / 22 is 



span(/ n + / 22 , /21) 



which, under ip a and then the isomorphism oj(X)* ® w(X) ~ Endfc(cu(X)), maps to 
the span of the transformations 



'1 o^ 



'0 ^ 



where we have written these transformations as matrices in the bases /1, / 2 for a;(X). 
Thus, End (a; I (X)) can be identified with the algebra of all 2 x 2 matrices of the form 



a 



y a j 



for arbitrary a and b. 

Another Example: Consider the module 



x 



x 



for the multiplicative group G rn . Skipping all the mumbo-jumbo with ip and a, all 
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we really have to do is find the invariant subspace of 



x 



\ 



x 



X 



X 



X 



X 



X 



X 



\ 



X 



J 



generated by the vector (1,0,0,0,1,0,0,0,1). This we compute to be the span of the 
vectors (1, 0, 0, 0, 0, 0, 0, 0, 0), (0, 0, 0, 0, 1, 0, 0, 0, 0), and (0, 0, 0, 0, 0, 0, 0, 0, 1). These 
in turn project to the span of the matrices 



/ 



V 



1 





\ 



/ 



V 




1 




^ ( o o o x 



/ 




1 



That is, we can identify End(u;| (X)) with the collection of all diagonal transformations 
on lu(X). 
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Chapter 4 

First-Order Definability of 
Tannakian Categories 

The goal of this chapter is to prove that, in a certain appropriately chosen lan- 
guage, the sentence "is a tannakian category" is first-order. Note that we do not 
claim that the property of 'being neutral' is necessarily first-order. 

4.1 The Language of Abelian Tensor Categories 

The title of this section is a misnomer for two reasons. First, the article "the" 
implies there is only one such language, and this is certainly not the case. It is 
however, as far as the author can tell, the most natural and minimal choice for our 
purposes. Secondly, not all structures in the 'language of abelian tensor categories' 
are abelian tensor categories, and we may as well have called it the 'language of 
tannakian categories'. But the name seems natural enough. 

Our language is purely relational; it has no function or constant symbols. The 
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primitive symbols are 



_ G Mor _ G Ob _ ° _ = _ _ + _ = _ 
<g> = assoc j > = comm t = unit = 

The intended interpretation of the symbols are as follows, x G Mor expresses that x 
is a morphism in the category, x G Ob that x is an object, x : y — » z expresses that 
the morphism x points from the object ytoz,xoy = z expresses that the morphism 
x composed with y is equal to z, and x + y = z expresses that the morphism x added 
to y is equal to z. x ®y = z means that the tensor product of x and y is equal to z, 
and this could either mean tensor product of objects or tensor product of morphisms. 

The symbols assoc, comm, and unit stand for the requisite natural isomorphisms 
present in an abelian tensor category. For instance, assoc^^ = t expresses that the 
associativity isomorphism (x <g> y) <g> z ~ x ® (y <8> z) attached to the objects x, y and z 
is equal to t, and similarly for comm ia = t. unit^ = y expresses that y is the natural 
isomorphism between the object x and x <g> 1, where 1 is an identity object for the 
tensor category. 

The reader should take care not to automatically identify the symbol = occurring 
as a sub-symbol of the above symbols with actual equality of elements; = is purely 
formal in this context. For a random structure in the above signature it is entirely 
possible to have four elements a, b, c, d such that a o b = c, a o b = d, but not c = d, 
where this last equation is actual equality of elements. Of course, we chose the symbol 
= because, modulo the theory we are going to write down, = does in fact behave like 
equality. 

To make anything we are about to do manageable, we must find a way to treat 
certain of the relational symbols in our language as if they were functional right from 
the start, and hence to treat expressions such as x o y as if they were terms. For 
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instance, we would like to be able to write the sentence 



(Vx,y)(xoy = yox) 



and attach the intended meaning to it. But as it stands, this is not a sentence in our 
language. Here is how this can be remedied. In the case of o we can treat the symbol 
ioi/asa term as follows. If $(z) is any formula, x and y variables, then we define 
$(i o ?/) to be the formula 



where t is some variable not occurring in $ and not equal to x or y. By iteration of this 
process we can in fact treat any 'meaningful composition' of variables as a term. By 
meaningful composition we mean: any variable a; is a meaningful composition, and if 
VP, E are meaningful compositions, then so is (\P)o(E) (for instance, ((xoy)oz)o(sox) is 
a meaningful composition). Then for a meaningful composition (\&) o (E) and formula 
3>(x), we define o (E)) by induction on the length of the composition to be 



where s, t and r are some not already being used variables. This formula is well- 
defined, since the formulas s — \P, t — E, and $(r) are by induction. For example 
then, the formula x o y = y o x literally translates to 



The same trick can obviously be applied to the symbols involving +, eg), assoc, comm, 



(yt)(xoy = t 



*(*)) 
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and unit. Thus we can be confident in the meaning of something like 

(\/x,y,z)((x + y) o z = (x o z) + (y o z)) 

Let us agree on some abbreviations. All capital English letter variables (A, B, X, Y, 
etc.) are understood to range over objects, lower case Greek letters (<f>,ip,ct, f3, 
etc.) over morphisms, and if we wish to be nonspecific we will use lower case En- 
glish letters (a,6, x,y, etc.). So if $(x) is a formula, we define (VX)$(X) to mean 
(Vx)(x G Ob =>• $(x)), and (\/ip)$(ifj) means (Vx)(x G Mor =^ $(x)). The 
formula (3X)$(X) stands for (3x)(x G Ob A $(x)), and similarly for (3ip)^(ip). 
(Vx)<E>(x) and (3x)<E>(x) mean exactly what they say. 

If ai, . . . , a n , x, y are variables then ai,...,a n : a; — > y is shorthand for ai : 
a; — > y A . . . A a n : x — >■ y. (Vai,...,a n : x — >■ y)$(ai, . . . , a„) is shorthand 
for (Vai,...,a n )(ai,...,a n : x ->■ y =>• $(ai, . . . , a„)), and (3ai,...,a„ : x ->■ 
y)$(ai, . . . , a n ) is shorthand for (3ai, . . . , a n )(ai, . . . , a n : x — > y A $(ai, . . . , a n )). We 
make identical definitions for the expressions Xi, . . . , x n G Ob and Xi, . . . , x n G Mor. 

If x and y are variables, we define the formula Dom(x) = y to mean (3z)(x : 
y — > z), and we make an analogous definition for Codom(x) = y. We can treat Dom 
and Codom as if they were functions by declaring: if $(x) is a formula, we define 
$(Dom(x)) to mean (Vy)(Dom(x) = y =>- $(y)), and similarly for Codom. 

The remainder of this chapter is devoted to proving, piecemeal, that the statement 
"is a tannakian category" is expressible by a first-order sentence in the language of 
abelian tensor categories. 
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.2 Axioms for a Category 

1. Every element of C is either an object or a morphism, but not both: 

(yx)({x G Ob V x G Mor) A ->(x G Ob A x G Mor)) 

2. All arrows are morphisms, and all vertices are objects: 

(\/x,y,z)(x : y ->■ z =>- (x G Mor A y E Ob AzG Ob)) 

3. Every morphism points to and from exactly one object: 

(V0)(3!X,F)(0:X^F) 

4. Composition only makes sense on morphisms: 

(Vrr, y, z)(x o y = z ==>- x,y,zE Mor) 

5. Composition only makes sense between composable morphisms, and the 
position points where it should: 

(V0, ip, r))((f) o if; = t] ==>- (Codom(0) = Dom(^) 
At] : Dom(0) ->■ Codom(^))) 

6. Composition is a function on composable arrows: 

(V0,^)(Codom(0) = Dom(^) (3!??)(0 o ^ = 77)) 
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7. Composition is associative: 



(Var, y, z)((3t)((x o y) o z — t) =>- (x o {y o z) = (x o y) o z)) 



We define the formula x = l y to mean that x is a two-sided identity morphism for 
y, i.e. as the formula (x : y — > y) A (Wz)((xoz = tVzox = i) ==>- z = t). We write 
x = 1 to mean that x is an identity morphism for some object, i.e. (3X)(x = lx)- 
If $(x) is a formula, we define $(l z ) to mean (Wy)(y = l x ==>■ $(?/)), and similarly 
for 1. 

8. Every object has an identity morphism: 



(VA)(30)(0 = 1 X ) 



4.3 Axioms for an Abelian Category 

In this section we build axioms amounting to the statement that a given category 



is abelian, using definition 3.2.1 as our guide. 



1. Addition is only defined on addable morphisms, and their sum points where it 
should: 

(Vx, y, z)(x + y = z ==>- (x, t/,z 6 Mor A Dom(x) = Dom(y) A Dom(y) 
= Dom(z) A Codom(x) = Codom(y) A Codom(y) = Codom(z))) 



2. Addition is a function on addable morphisms: 



(Vx, y)((Dom(x) = Dom(y) A Codom(x) = Codom(y)) ==>- (3\z)(x +y — z)) 
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3. Addition is associative and commutative: 



(Vx,y, z)((3t)((x + y)+z — t) =>- (x + y = y + x A (x + y) + z = x + (y + z))) 

Define the formula x = V;Z to mean x is an additive identity for Hom(x, y). That 
is, (x : y ->■ z) A (Vf : y ->■ + t = t). Define x = to be (3X, = x ,y). If 
$(x) is a formula, $(0 X)J/ ) means (Vz)(z = X:V =>• $(^)), and similarly for 0. 

4. Existence of zero morphisms for addition: 

(VA,B)(3<P)(x = AB ) 

Define the formula x = —y to mean x is an additive inverse for y. That is, x+y = 0. 
For a formula <&(#), $(— y) is shorthand for the formula (Vx)(x = — y =>- $(a;)). 

5. Existence of additive inverses: 

W)(ix)(x = -4) 

6. Bilinearity of composition over addition: 

(VA,B,C,D)(Vri,(f),ilj,v)((ri : A -> B A <j>,ip : B -> C A v : C -> D) 
(r]o((/> + ip) — r]0(f> + r]ot/}A ((f) + ip)orj — (f)or] + ipo rfj) 

The definition of an abelian category calls for the existence of pair-wise biproducts, 
kernels and cokernels, and normality of monomorphisms and epimorphisms. Here we 
give first-order definitions of these concepts. 
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Let A and B be objects. Then a biproduct, which we denote as A © B, is a 
diagram 



A B 




A B 



with the following properties: tta ° la + Kb ° £b = J-Aes, 1,4°^ = 1a, lb ttb = 1b, 
l a o ti b = 0, and o 7Tyi = 0. One could clearly write down these conditions as a 
first-order formula. Thus, for objects A and B we define the formula Sum(Z; A, B) 
to mean that there exists maps la, ib, 71 a, ^b satisfying all the above criteria. 

Let ip : A — > B be a morphism. A kernel for vp is by definition a map k pointing 
from some object K to A such that k o ip = KB , and for any object C and map 
p : C — > A with p otp — C B there is a unique map p : C — >■ K such that p o k = p. 
Again, this is clearly first-order. Thus, we define: ker(A;; iji) means that the morphism 
A; is a kernel for ip. The same obviously holds for the dual concept of cokernel, so we 
define coker(c; ip) in like fashion. 

In the language of categories saying that a morphism is an epimorphism is to say 
that it is right-cancellative. That is, ip : A — > B is an epimorphism if for any maps 
r],i/ : B — >■ C ijj o r) — ijj o v implies that rj = v. Again, this is clearly a first-order 
concept, so we define the formula epic(0) to mean is an epimorphism, and likewise 
monic(0) that is a monomorphism. 

7. Every pair of objects has a biproduct: 



(VA,B)(3Z)(Sum(Z;A,B)) 
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8. Every morphism has a kernel and a cokernel: 

(V0)(3£;,c)(ker(£;;0) A coker(c;</>)) 

9. Every monomorphism is normal: 

(V0)(monic(0) ==>• (3-0)(ker(0; -0)) 

10. Every epimorphism is normal: 

(V0)(epic(0) =>- (3-^)(coker(0; -0)) 

4.4 Axioms for an Abelian Tensor Category 

Here we build axioms asserting that given abelian tensor category is an abelian 



tensor category, per definition |3.2.2| Our first task is to assert that © is a bi-additive 
functor. 

1. Every pair of morphisms and objects has a unique tensor product: 

(VX, Y)(3\Z)(X ®Y = Z) A (V</>, ^)(3!t/)(0 <8> V = r/) 

2. The tensor product of objects is an object, that of morphisms is a morphism, 
and there's no such thing as a tensor product of an object and a morphism: 

(VX, Y) {X®Y e Ob) A (V0, V) (</>©?/> G Mor) A (VX, ($x) (X©</> = ^ V </>©X = x) 
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3. The tensor product of morphisms points where it should: 
(V(f>,ilj)(VA,B,X,Y)((<t> : A ->■ B Aip : X ->■ F) =^ (00^:^0X^5®^)) 

4. The tensor product preserves composition: 

(V(f),ip,ri,v)(VA,B,X,Y,S,T)((<l> : A ^ X A i/j : B ^ Y A rj : X ^ S A u :Y ^ 

(((f) or])®(ipo v)) = ((f) ® ^) o (77 <g> 1/))) 

5. The tensor product preserves identity: 

(Vi,B)((lA®l B ) = W) 

6. The tensor product is a bi- additive functor: 

(VA5,X,r)(V0,^ : A ->■ S)(V?7 : X ->■ F) 
(((0 + V ; )®^ = 0®^ + V ; ®^) A (rj ® ((j) + = rj ® (f) + rj ® 

Next we assert that the natural isomorphisms assoc, comm, and unit are doing 
the job we need them to. We start with assoc. 

7. assoc accepts objects and returns morphisms: 

(Vx, y, z, t) (assoc^^ = t ==>- (x,y,z G Ob A £ G Mor)) 

8. assoc is a function on triples of objects: 

(VX,F,Z)(3!0)(assocx,y,z = 0) 
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If <&(x) is a formula and a,b,c variables, by $(assoc aj ;, )C ) we mean the formula 
(Vt)(assoc aAc = * =>. $(t)). 

9. assoc points where it should: 

(VX, Y, Z)(assoc Xi y jZ : X <g> (Y <g> Z) ->• (X <g) Y) <g> Z) 

We define a formula iso(0) to mean that the morphism is an isomorphism: 
(VA, B){(f>:A^B (3-0 : B ->■ A)(0 o -0 = 1 A A ip o = 1 B )). 

10. assoc is always an isomorphism: 

(VX, Y,Z) (iso(assoc x ,y,z)) 

11. assoc is a natural transformation: 

(VX, Y, Z, R, S, T)(V0 : X ->■ i?, V : Y ->■ 5, 77 : Z ->• T)(the following commutes: 

x ® (y ® z) assocx - y -> (x ® y ) ® z 



V V 

R®(S®T) > (R®S)®T 

assoc RtS ,T 

We make the necessary assertions and definitions for comm and unit in like fashion. 
12. comm accepts objects and returns morphisms: 

(Vx, y, z)(comm xy = z =>- (x,y G Ob Az6 Mor)) 
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13. comm is a function on pairs of objects: 

(VX,F)(3!0)(comm x , y = 0) 

If <&(#) is a formula and a, b variables, by $(comm ai b) we mean the formula 
(Vt)(assoc aj6 = t => $(*)). 

14. comm points where it should: 

(VX, F)(comm Xi y : X <g> F ->■ F <g> X) 

15. comm is always an isomorphism: 

(VX,F)(iso(comm XiY )) 



16. comm is a natural transformation: 



(VX, Y, R, S) (V0 : X — )• _R, : F — )> 5) (the following commutes: 



R®S >S® R 

comm fliS 

17. unit accepts objects and returns morphisms: 



(Vz,y) (unite = y =>• (x G Ob A y E Mor)) 
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18. unit is a function on objects: 

(VX)(3!0)(unit x = 0) 
If <&(x) is a formula and a a variable, by $(unit a ) we mean the formula (Vt)(unit a = 

t =► $(0). 

19. unit is always an isomorphism: 

(VX)(iso(unit x )) 

For unit, we must make the additional assertion that there exists an identity object 
for <g). 

20. unit has an identity object associated to it: 

(3f/)(VX)(unit x :X->U®X) 

We define the formula id®(u) to mean that u is an identity object associated to 
unit. That is, (VX)(unit x : X ->■ u <g> X). 

21. unit is a natural transformation: 

(VX,F)(V0 : X -> F)(VC/)(id®([/) the following commutes: 



F 



unity 



[/(g)F 
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We must now assert that the functor X i— > id®(-u) ® X is an equivalence, which 
is to say that it is full, faithful, and essentially surjective. Essential surjectivity is 
already asserted by previous axioms: for every X, X is isomorphic to id®(«) ® X. 
Thus we must assert that it is full and faithful. 

22. The functor X H> id (u) ® X is full and faithful: 

(V</>)(VX,Y,[/)((id®([/)A</> : U®X ->• U®Y) (3!^)(^ : X -> X A = 0)) 



All that is left then is to assert the various coherence conditions among assoc, 



comm, and unit, conditions 4, 5, and 6 in definition 3.2.2 All of these statements 
are of the form, for some fixed n, "for all objects Xi, . . . ,X n , the following diagram 
commutes." These are plainly first-order, so we do not repeat them. 



4.5 Axioms for a Tannakian Category 



Here we must assert the rigidity of the abelian tensor category C (definition 3.2.3) 
and that End(l) is a field, modulo all of our previous axioms. The first condition for 
rigidity is the existence of an internal Horn object for every pair of objects (see page 



35 ). This is by definition an object Hom(X, Y) such that the functors Hom( , X®Y) 

and Hom( , Hom (X, Y)) are naturally isomorphic. Here is how we can define this 

in a first-order fashion. 

Suppose that Z is an internal Horn object for X and Y. Then we have a natural 
isomorphism of functors Hom( , Z) — > Hom( <g>X, Y). The Yoneda lemma (page 



18) guarantees that this map $ must take the following form: for an object T and 
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map T — > Z, $t(0) is the unique map making the diagram 



T®X 




commute, where we have given the name ev to the element $^(1^) G Hom(Z£g>X, Y). 
And in fact, any such map Z®X — > Y gives you a natural transformation between the 

functors Hom( , Z) and Hom( (g) X, Y). Thus, to assert the existence of a natural 

isomorphism, we need only assert the existence of a map ev : Z®X — > Y such that the 
natural transformation $ it defines gives a bijection Hom(T, Z) — ^> Hom(T ® X, Y) 
for every T. We therefore define the formula Hom(Z, ev; X, Y) to mean that the 
object Z and morphism ev form an internal Horn pair for X and Y: 



(ev : Z <g> X -»■ y) A (VT)(W : T <g> X -»■ y)(3!0 : T ^ Z)(the following commutes: 

T®X 




1. Every pair of objects has an internal Horn: 



(VX, y ) (3Z) (30) (Hom(Z, 0; X, K) ) 



Recall now the definition of reflexivity of the object X (see page 37); it is the 
assertion that a certain map ix '■ X — > X vv is an isomorphism. This map is defined 
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by the property that it uniquely makes 

X®X V 




X vv <g) X v > 1 



ev x v 

commute. But of course ix is not really unique, since neither is e.g. X v , since there 
are in general many (mutually isomorphic) choices for internal Horn. But for a fixed 
choice of the various internal Horn objects referenced in this diagram, it is unique. 
We therefore define incl(i; X) to mean that i qualifies as one of these maps: 

(3T, R, U) (3(f), V) (id® (U) A Hpm(T, 0; X, U) A Rom(R, tp; T, U) 

A i : X — > R A the following commutes: 
X®T 




R®T > U 

2. All objects are reflexive: 

(VX)(VO(incl(i;X) =^ iso(t)) 
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Our last task in defining rigidity is to assert that the map $ referenced in diagram 



3.2.3 is an isomorphism. First, given objects R, S,T and U, let us define the isomor- 



phism (R <g> S) <g> (T <g> U) ~ (R <g> T) g (S g £/") referenced in diagram |3.2.2| This is 
gotten by composition of the following sequence of commutativity and associativity 
isomorphisms (the subscripts of which we suppress) : 



(R <8> S) g (T (8) [/) 72 g ( 5 g (r g [/)) 



1® assoc 



> i? g ((£ g T) g [/) 



l®(co mm ®l) fl ^ ((T ^ g) ^ v) agsoc (fl ^ (T ^ g)) ^ y assoc®! > ^ y) ^ g) ^ ^ 

^ (i? T) (g) (5 (8) C/) 



Define the formula ISO(i?, S, T, U, \I/) to mean that \I/ is the above composition with 
respect to the objects R, S, T and U. Next we must define the map $ defined by 



the commutativity of diagram 3.2.3 This is done using a similar strategy to that 
used to define reflexivity. Define the formula QUAD(Xi, X 2 , Yi, Y 2 , $) to mean that 
$ qualifies as one of the maps referenced in diagram 3.2.3| with respect to the objects 
Xi, X 2 , Y\,Y 2 . 



(3Z 1 ,Z 2 ,Z)(3ev 1 ,ev 2 ,ev,^)(Hpm(Z 1 ,ev 1 ;X 1 ,F 1 ) A Hom(Z 2 , ev 2 ; X 2 , Y 2 ) 
AHpm(Z, ev; X l g X 2 , Y x g Y 2 ) A lSO(Z 1 , Z 2 , X h X 2 , *) A the following commutes: 
(Zi g Z 2 ) g (Xx g X 2 ) (Zi g Xi) g (Z 2 g X 2 ) 



*<g>i 



evi®ev2 



Z g (Xi g X 2 ) 



Y X ®Y 2 



3. For all objects X 1; X 2 , Yi, Y" 2 , the map $ in diagram 3.2.3 is an isomorphism: 



(VX 1; X 2 , Fi, F 2 )(V$)(QUAD(X 1 , X 2 , Yi, F 2 , $) 



$ is an isomorphism) 



74 



Finally, we have to assert that the ring End(l) is a field. Given any object X, our 
previous axioms already assert that End(X) is a ring with unity, so let field(X) be 
the assertion that End(X) is a commutative ring with inverses: 

(V0, ip : X ^ X)((f) o ip = ip o 0) A (V0 : X — >■ X)(-i(0 = 0) =^ o -0 = l x )) 

4. End(l) is a field: 

(VC/)(id®(C/) =^ field(C/)) 

We have proved that the statement "is a tannakian category" is expressible by a 
first-order sentence in the language of abelian tensor categories. 
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Chapter 5 

Subcategories of Tannakian 
Categories 

In this chapter we record some results on (abelian, tannakian) subcategories of 
(abelian, tannakian) categories which will be needed later. For instance we will 
prove useful criteria which allow us to conclude that a given subcategory of a neutral 
tannakian category is also neutral tannakian. 

Proposition 5.0.1. Let C be a full subcategory of the tannakian category V. Then C 
is tannakian if it is closed under the taking of biproducts, subobjects, quotients, tensor 
products, duals, and contains an identity object. 

Proof. We first show that C is abelian. C is a full subcategory of V, and so obviously 
Horn-sets still have the structure of an abelian group, and composition is still bilinear. 
Consequently, being a zero or identity morphism in C is coincident with being one in 
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T>. If A © B is the 22-biproduct of the C-objects A and B, then we have a D-diagram 



A B 




A B 



such that tt a o la + ttb ° ib = 1a@b, la ° ^a = 1a, t£ ° ^b = Is, «A ° 7Tb = 0, and 
tgo tta = 0. But all these maps exist in C as well, along with the given relations, so 
this diagram constitutes a C-biproduct for A and B. 

4> k 

Let A — > B be a C-morphism, and — )■ A its D-kernel; this map exists in C as 
well, since is a subobject of A. If L — > A is any C-morphism with tfj o <fi = in C, 

then this composition is zero in T> as well; consequently, there is a unique morphism 

^ — - 

L y K such that ip o k = if). As C is full, this map ip exists also in C, and is clearly 

still unique. Thus k is a C-kernel for <ft as well, which shows that all kernels exist in 

C. An analogous proof holds for the existence of cokernels, using the fact that C is 

closed under quotients. 

Let A — > B be a C-monomorphism; we claim that it is also a monomorphism 

in V. Let X — > A be any D-morphism such that ip o <\> = 0; we wish to show that 

ij} — 0. As every morphism in an abelian category factors through an epimorphism 

and a monomorphism (page 199 of p3j), we have a commutative diagram 



C 



X 




>B 
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where ir and i are a P-epimorphism/monomorphism respectively, giving 7r o l o = 0. 
As 7r is epic, we have i o = 0. But C is a subobject of A, hence a member of C, and 
so by the C-monomorphic property of 0, i = 0. Thus ip = tt o l equals as well, and 
we have shown that is a D-monomorphism. 

So if A — > B is any C-monomorphism, it is also a D-monomorphism, thus normal 
in V. Then let -B — > C be the D-morphism for which is the D-kernel. Again ip 
factors through an epimorphism and a monomorphism, and we have a commutative 
diagram 

X 



A > B > C 

The map B X exists in C, X being a quotient of B. We claim that is a C-kernel 
for 7T. If L —^-t- B is any C-map such that i] o 7r = 0, then also r]otjj = rio'KOL = 0; as0 
is a kernel for tp, there is a unique map L - > A such that fj o = r], which satisfies 
the universal property of being a C-kernel for n. Therefore all monomorphisms are 
normal in C. An analogous argument shows that all epimorphisms in C are normal. 
Therefore C is an abelian category. 

Since the tensor product of two objects in C is also in C, so also is the tensor prod- 
uct of two morphisms, since C is full. For objects A, B and C of C, the associativity 

map (A £3 B) <g> C A ' B,C > A® (B ® C) exists in C, and is clearly still natural. Just 

as 'monomorphic' and 'epimorphic' are identical concepts in C and T>, so is 'isomor- 
phic', and thus assoc is a natural isomorphism in C. Analogous statements hold for 
the requisite isomorphisms comm and unit, the latter existing in C since the identity 
element of T> is stipulated to exist in C. The coherence conditions 4., 5. and 6. of 



definition 3.2.2 clearly also still hold, as well as the bilinearity of ®. 
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In any tensor category, one can in fact identify the object Bx>m(A, B) with A y <g)B; 
as duals are assumed exist in C, so also do all internal Horns, as well as the requisite 



'ev' maps since C is full. The remaining conditions of definition 3.2.3 merely stipulate 
that certain maps must be isomorphisms; as C is full, these maps also exist in C, and 
are isomorphisms since they are in T>. And of course, End(l) is still a field. This 
completes the proof. 

□ 



Lemma 5.0.2. Let C be a full abelian subcategory of the abelian category T> which is 
closed under the taking of biproducts, subobjects, and quotients. 

1. The C-diagram 




is a C-pullback for X 2 — % Z X\ if and only if it is also a V-pullback for 



2. The C-diagram 




is a C-pushout for X 2 — Z X\ if and only if it is also a V-pushout for 

4>2 ry V>1 



X 



x 1 
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Proof. We will prove this for pullbacks, leaving the pushout case to the reader. Sup- 
pose first that 




is a "D-pullback diagram. Let T be an object of C, and suppose we have a commutative 
diagram 




Then by the universal property of being a "D-pullback, there is a unique D-map 
p : T — > X such that p o n 1 = p 1 and p o 7r 2 = p 2 - As C is full, this map p exists 
in C as well, satisfies these relations, and is clearly still unique. Thus this diagram 
constitutes a C-pullback as well. 

Conversely, suppose the above is a C-pullback diagram. As T> is abelian, we know 



that Xi 



02 



X 2 has a "D-pullback, say 




The proof of theorem 2.15 of [7] shows that U can always be taken to be, up to 
isomorphism, a certain subobject of X\ © X 2 . As C is closed under the taking of 
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biproducts and subobjects, U is an object of C, and as C is full, /xi and /i 2 are 
morphisms in C; thus, this diagram belongs to C. Then by the above, as this diagram 
is a D-pullback, so also is it a C-pullback. As any two pullbacks in an abelian category 
are isomorphic up to a unique isomorphism, we must have that 




is a D-pullback as well. □ 

Lemma 5.0.3. Let T> be an abelian category, C a non-empty full abelian subcategory 
of T> . Then exact sequences in C are also in T> if and only if for every morphism 
A — B in C, there is a V-kernel of <p, V-cokernel of 4>, and V-direct sum which all 
lie in C. 

Proof. See theorem 3.41 of [7]. □ 

In the case of 1-fold extensions in a /c-linear abelian category, we shall need some- 
thing slightly stronger. 

Proposition 5.0.4. Let V be a k-linear abelian category, C a non-empty full k-linear 
abelian subcategory ofT>. Let M and N be objects ofC and let £i, . . . , £ m be a sequence 
of 1-fold extensions of N by M in C. Then the £j are linearly independent in C if and 
only if they are linearly independent in T>. 

Proof. By 'linearly independent', we mean with respect to the k- vector space structure 



defined by the Baer sum on Ext 1 (M, N) (see section UM. 

Denote by EXT^ (M,N) the collection of all 1-fold extensions of M by N in C, 
and define similarly EXT^,(M, N) (this is different from Ext c (M, N), which is the 
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collection of all equivalence classes of 1-fold extensions). By the previous lemma, for 
every £ £ EXT^M, N), £ is also a member of EXT^,(M, N), whence we have a map 
EXT^(M, N) ->■ EXT^(M, JV). 

We claim firstly that this map respects equivalence of extensions. If £ : — > N — > 
X — > M — > 0, x '■ ® — > N — > Y — > M — > are two C-equivalent extensions, then we 
have a C-isomorphism (ft : X — > Y making 

£ : >N >X > M >0 

X : > N >X > M > 

commute. But as is a ©-isomorphism as well, £ and x are a l so ©-equivalent. 
Thus, our map EXTc(M,iV) ->■ EXT^(M,N) is actually a map Extc(M,7V) ->■ 
Extp(M, TV). This map is injective, for if £ and x are ©-equivalent according to the 
above diagram, then they are also C-equivalent, since the map (ft exists in C. 

What is left then is to verify that this map is linear. Let k be a scalar and £ : 
— > iV X — M — > a C-extension of M by TV. Then the scalar multiplication 
of k, when k ^ 0, is defined to be 

i(:0-)iV fc 4lAM40 

and in case k = 0, as the trivial extension. As scalar multiplication of morphisms and 
trivial extensions are defined the same way in C as in V, so also is scalar multiplication 
of extensions. 

Let £, x be two extensions. To compute the Baer sum £ © we are asked to 
compute a certain pullback, to compute a pair of unique maps pushing through a 
pullback, to compute a certain cokernel of one of these maps, to compute a unique map 
pushing through a cokernel, and finally to compute a certain composition. Lemma 
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5.0.2, as well as the proof of lemma 5.0.1, show that all of these constructions lead to 
the same answer whether done in C and D. We conclude that if £ © x = V m so 
also does this equation hold in D. This completes the proof. □ 

Lemma 5.0.5. A full tannakian subcategory of a neutral tannakian category is also 
neutral (over the same field and via the restriction of the same fibre functor). 

Proof. We are given a neutral tannakian category D with fibre functor u : T> — > Vec/j, 
where k is the field End(l). We want to show that this functor restricted to C, which 
we still call u, qualifies as a fibre functor on C. Looking at the conditions of definition 



3.2.5| it is easy to verify that co restricted to C is still a tensor functor. The requisite 
isomorphism cx,y '■ oj{A <g> B) ~ w(A) <g> oj(B) still exists, is natural, and still satisfies 
the relevant diagrams, since e.g. assoc in C is the same as assoc in T>. Faithfulness 
and fc-linearity are also clearly still satisfied; all that remains to check is exactness. 



By proposition 5.0.1, C and T> satisfy the hypothesis of lemma 5.0.3, and thus exact 
sequences in C are also in T>. As w preserves exact sequences in "D, so must it also 
when restricted to C. □ 
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Chapter 6 

Some Ultraproduct Constructions 



In the next chapter we shall be studying ultraproducts of tannakian categories. In 
this chapter we define several ultraproduct constructions and record several results on 
them that will soon be necessary; the learned reader may wish to treat this chapter 
merely as a reference. The reader may also consult the appendix for a review of 
ultrafilters and ultraproducts in general. 

In this dissertation, if M, is a collection of relational structures in a common first- 
order signature, indexed by /, and if U is a non-principal ultrafilter on /, we denote by 
]l w Mi the ultraproduct of those structures with respect to U. For a tuple of elements 
(xi) from the Mj, we denote by [xj] its equivalence class, that is, its image as an 
element of IIw-^- When we make statements like "the ultraproduct of vector spaces 
is a vector space over the ultraproduct of the fields" , it will always be the case that 
these ultraproducts, both for fields and vector spaces, are being taken with respect 
to the same fixed ultrafilter (as indeed it makes no sense to assume otherwise). 

6.1 Fields 

Let ki be a sequence of fields indexed by /. We treat the fc, as structures in the 
language +, *, — ,0, 1 with the obvious interpretation. For brevity we write the term 
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x * y as the juxtaposition xy. As always we fix a non-principal ultrafilter U on / 
throughout. 

Proposition 6.1.1. is afield. 

Proof. Simply realize that the axioms for a field are first-order sentences in this lan- 
guage: 

1. (Vs, y, z)((x + y) + z — x + (y + z) A x{yz) = (xy)z) 

2. (Vx, y) (x + y = y + x A xy = yx) 

3. (Vx)(lx = xA0 + x = x) 

4. (Vx)0 + (-x) = 0) 

5. (Vx, + z) = xy + xz) 

6. (Vx)Hx = 0) => (3y)(rcy = l)) 

7. -(1 = 0) 



Now apply corollary CO. 16 



□ 

Proposition 6.1.2. Suppose that ki is a sequence of fields of strictly increasing pos- 
itive characteristic. Then Yl u ki has characteristic zero. 

Proof. For a fixed prime p, let char p be the statement 1 + 1 + .. . + 1 = (p-occurrences 
of 1). As the ki have strictly increasing characteristic, for fixed p, char p is false in all 
but finitely many of them. Thus -ichar p holds on a cofinite set, which is always large, 
and so -ichar p holds in the ultraproduct. This goes for every p, which is equivalent 
to Yl u ki having characteristic zero. □ 
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6.2 Vector Spaces 

Let Vi be an indexed collection of vector spaces over the fields hi. We treat the 
Vi simply as structures in the signature +, 0, i.e. as abelian groups, forgetting for 
the moment the scalar multiplication. Then Y\ u Vi is also an abelian group in this 
signature, and the addition is of course given by [vj\ + [wi] = f [vi + Wj\. 

Let k = n u ki be the ultraproduct of the fields fcj as in the previous section. We 
assume as always that both of these ultraproducts are being taken with respect to 
the same fixed non-principal ultrafilter. 

Theorem 6.2.1. Yl u Vi is a vector space over Yl u ^i> under the scalar multiplication 

N [Vi] = [OiVi] 

Proof. The given multiplication is well-defined: If (a*), {pi) are equal on the large set 
J, and if (v^, (wi) are equal on the large set K, then (a^j) and ipiWi) are equal on 
at least the large set JDK. It is routine to verify that this definition satisfies the 
axioms of a vector space. □ 

Proposition 6.2.2. The finite collection of linear equations of the form 

Ml Ml + . . . + [<ii]„[^]n = [o] 

is true in Y\ u Vi if and only if the corresponding collection of linear equations 

aj,it>i,i + . • • + a ijn Vi !n = 

is true for almost every i. 

Proof. For the forward implication, the claim is obvious in the case of a single equa- 
tion, since the first equation is equivalent to [a i; iv i;1 + . . . + a^ n v i>n \ = [0]. For a 
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finite set of equations, each individual equation holds on a large set, and taking the 
finite intersection of these large sets, we see that there is a large set on which all the 
equations hold. The reverse implication is obvious. □ 

Proposition 6.2.3. A finite set of vectors [e«]i, . . . , [ei\ n is a basis for W u Vi over 
Y\ u ki if and only if, for almost every i, the set of vectors e^i, . . . , e^ n is a basis for Vi 
over ki . 

Proof. Given a linear dependence [eij]i[ej]i + ... + [aj] n [ej]„ = [0] in fJ^Vi, we get 
a linear dependence for almost every i, by the previous proposition. If [aAj ^ [0], 
then djj 7^ for almost every i, and by taking another intersection we get a non- 
trivial dependence in almost every i. Conversely, if we have a non-trivial dependence 
ai,i e i,i + - • ■ + 0'i,n^i,n = in almost every i, the equation [aj]i[ei]i + . ■ ■ + [aj] n [ei]„ = [0] 



holds in n^V^. By lemma B.0.9, at least one of the [ai\j must be non-zero. 

If ey span Vi for almost every i, then for every [vi\ G riu^> we nave an almost 
everywhere valid equation a^ie^i + . . . + ai tn e^ n = Vi in Vi, which projects to an 
equation [oj]i[ej]i+. • - + [ai]n[ei]n = K], showing that the [e^]j span n w ^- Conversely, 
if the ejj, almost everywhere, do not span Vi, choose for each of those slots which 
are not in the span of the e^j. Then neither can [vi] be in the span of the [ei\j, lest 
we project back to an almost everywhere linear combination for Vi in terms of the 
ey. □ 

Proposition 6.2.4. For a fixed non-negative integer n, Y\ u Vi has dimension n over 
Yl u ki if and only if almost every V, has dimension n over ki. Yl u V, is infinite dimen- 
sional over Ylu^i if an d only if, for every n, almost every Vi does not have dimension 
n over ki. 

Proof. Apply the previous proposition. □ 

We see then that if Vi is almost everywhere of dimension n < oo, it does no harm 
to assume that it has dimension n everywhere. We call such collections constantly 
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finite dimensional or sometimes boundedly finite dimensional. With few ex- 
ceptions it is these types of collections of vector spaces we will be concerning ourselves 
with. 

6.2.1 Linear Transformations and Matrices 

For a collection of vector spaces V i: Wi and linear maps fa : Vi — >■ Wi, denote by 
[fa] the linear map Y\ u Vi — > Ylu^i defined by 

[fa]([ Vi ]) = [fa( Vi )] (6.2.1) 

Denote by ]l M Hom fe .(Vi, Wi) the collection of all such transformations of the form 
[fa]. So long as the Vi are constantly finite dimensional, we are justified in using this 
notation because 

Proposition 6.2.5. If the Vi are of constant finite dimension, [fa] = [ipi] as linear 
transformations if and only if, for almost every i, fa = ipi as linear transformations. 
Further, [fa] o [^] = [fa o fa], [fa] + [fa] = [fa + fa], and for an element [a,-\ of Y[ u ki, 

M [fa] = [aifa] ■ 

Proof. The 'if direction is obvious. For the converse, let [ej]i, . . . , [ej] n be a basis 
for whence, for almost every i, e^i, . . . , e i>n is a basis for V^. [fa] = [ipi] if 

and only if they agree on this basis, so let J m , m — 1 . . . n be the large set on which 
fa(e™) = ipi^e™). Then the finite intersection of these J m , on which fa and ipi agree 
on every basis element, thus on which fa = ipi, is large. The last three claims of the 
proposition are now obvious. □ 

If the Vi are of unbounded dimensionality, the theorem does not hold; the proof 
falls apart when we try to take the intersection of the J m , which in this case may well 
be an infinite intersection, and not guaranteed to be large. 
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Proposition 6.2.6. If V it Wi are constantly finite dimensional collections of vector 
spaces, then 

Hom^iUVuUWi) ^ UHom^Wi) 

u u u 

Proof. It is easy to verify that the right hand side of the above claimed isomorphism 
is always included in the left hand side (even if the Vi or Wi are not boundedly finite 
dimensional). For the other inclusion, pick bases [e\], . . . , [e™] and [//], . . . , [//"] of 
W u V,i and respectively. For a linear transformation <fi in the left hand side of 

the claimed isomorphism, write it as an n x m matrix 



r 

V h J 



K } 



[c 



J 



in the given bases. Then one can verify by hand that is of the form [0j] , where each 
(pi is the transformation Vi — > Wi given by the matrix 



/ 1,1 



1,71 \ 



m,l 



n,m 



in the bases ej,...,e?, 



□ 



We can therefore always assume that a linear transformation Yiu^i ~^ Ylu^i 1S 
uniquely of the form [0j], so long as the Vi and Wi are of constant finite dimension. 
This theorem is not true if the Vi and Wi are not both boundedly finite dimensional; 
the forward inclusion fails. 



Definition 6.2.1. Let be a sequence ofnxm matrices over the fields k i: given 
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by 



/ 1,1 

a/ 



m,l 
CL; 



\ a i ■■■ a i ) 

Then we define the ultraproduct of these matrices, denoted [Mi], to be the n x m 
matrix over the field Y\ u ki gi ven by 



/ r i,ii 



V 



r m.ll 

h J 



/ 



Proposition 6.2.7. Let Vi, Wi have constant dimension n and m, with bases e], . . . , e" ; 
/?•,..., / 2 m respectively. If <pi '.V\ —> Wi is represented by the n x m matrix Mi in the 
given bases, then [4>i] is represented by the matrix [Mi] in the bases [e\], . . . , [ei],[f}], . . . , 



Proof. Obvious. 



□ 



Applying propositions 6.2.6 and 6.2.7 together yield 



Corollary 6.2.8. For integers m and n and fields hi, 



Mat n , m (X[ki) ~ Y\Mat^ m (ki) 

u u 

where the latter stands for all matrices of the form [Mi], Mi e Mat n ^ m {ki). 

As we have seen, an ultraproduct of linear transformations preserves composition, 
addition, and scalar multiplication. By induction on complexity, it thus also pre- 
serves any equation involving a finite combination of these three operations, and by 
considering a finite intersection of large sets, the same is true for any finite collection 
of such equations. We state this as a theorem. 
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Theorem 6.2.9. Let [<f>i]i, . . . , [4>i] n be a finite collection of linear transformations, 
all between ultraproducts of constantly finite dimensional vector spaces. Then a finite 
collection of equations among the [(pi\j involving addition of maps, composition, and 
scalar multiplication is valid if and only if the corresponding collection of equations 
among the (pn, . . . , (pi n is valid almost everywhere. 



By proposition 6.2.7, the same is true for matrices: 



Corollary 6.2.10. The same is true for a finite collection [Mj]i, . . . , [Mi\k of matrices 
over Yl u ki, if we replace 'addition of maps', 'composition', and 'scalar multiplication 
of maps ' with 'addition of matrices ', 'multiplication of matrices ', and 'scalar multi- 
plication of matrices'. 

Proposition 6.2.11. Over collections of constantly finite dimensional vector spaces, 
ultraproducts preserve injectivity, surjectivity, kernels and cokernels. 

Proof. Let Vi Wi be a collection of linear maps. Suppose first that almost every 
4>i is injective. Then if [vj\ ^ [0], Vi ^ for almost every i, and taking the intersection 
of these two large sets, <pi{vj) ^ for almost every i, which is the same as saying 
[</>*] (ND 7^ [0]- Thus [cpi] is injective. 

For the converse, we must use the constant finite dimensionality of the Let 
[ei\i, . . . , [ei\ n be a basis for so that e «,i' • • • ' e i,n is a basis for almost every Vi, 

say on the large set J. Suppose that almost every (pi is not injective, say on the large 
set K , and for each i 6 K let Vi = a% ie, i + . . . + a« n be a non-zero vector such that 



;i(vi) = 0. Then at least one of the is non-zero for each i. By lemma B.0.9 , at least 
one of [a,i]j is non-zero in n w ^*- Then we see that [0i]([ai]i[ei]i + . . . [aj] n [ej] n ) = [0], 
but [oj] m 7^ 0; hence [(pi] is not injective. 

The proof of surjectivity is similarly proved, using instead the constant dimen- 
sionality of the Wi. 
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Suppose that, for almost every i, fa is a kernel map for ipi. This is the assertion 
that (pi is injective, that everything in the image of fa is killed by ipi, and that nothing 
outside the image of (pi is killed by ip{. That [fa] is also injective has already been 
proved. To say that [vj\ is in the image of [fa] is equivalent to saying that v j is in the 
image of fa for almost every i, and to say that [ipi] kills [wi] is equivalent to saying 
that ipi kills Wi for almost every i; the same goes for their negations. 

The case of cokernels is proved similarly, using instead the fact that surjectivity 
is preserved. □ 

Proposition 6.2.12. Over collections of constantly finite dimensional vector spaces, 
the collection of diagrams 



o ->■ x; . . . x™ +1 



is almost everywhere exact if and only if the corresponding sequence 



[o]^[ Xl ] 1 ^h...^h[x l ] n+1 ^[o] 



is exact. 



Proof. The assertion that the sequence 



X Ay 



is exact amounts to the assertion that is a kernel for ip, minus the requirement 



that (p be injective, which (the proof of) proposition 6.2.11 shows to be preserved 
by ultraproducts. Checking that the above sequences are exact amounts to checking 
finitely many sub-sequences of this form, which is preserved by ultraproducts. □ 
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6.2.2 Tensor Products 

Proposition 6.2.13. Let Vi,Wi be (not necessarily boundedly finite dimensional) 
collections of vector spaces. Then there is a natural injective map 



UK ® rw Wi ® Wi 



given by [vj\ <8> [wi] h- > [i^ (g> Tne image of $ consists exactly of those elements 
having bounded tensor length. The map $ zs an isomorphism if and only if at least 
one of the collections Vi or Wi are of bounded finite dimension. 

Proof. That the map $ is well-defined is easy to verify, remembering of course that 
the same ultrafilter applies to all ultraproducts under consideration. Injectivity is 
likewise easy to verify. Any element on the left hand side, by the very definition of 
tensor product, has bounded tensor length, and hence so must its image on the right 
hand side. Conversely, if E?=i v ij ® w ij] is an element of bounded tensor length on 
the right hand side, then ^2™ =1 [vi]j <8) [wj\j is a pre-image for it on the left. 

To prove the isomorphism claim: if V and W are vector spaces of finite dimension 
n and m respectively, then the maximum tensor length of any element of V <g> W is 



min(n, m) (see lemma 9.1.4). Then if say Vi is of bounded finite dimension n, any 
[xi] e riw^i ® Wi is almost everywhere a sum of no more than n simple tensors, and 
is in the image of $. 

Conversely, suppose neither of Vi or Wi are of bounded dimension. For each i, 
choose X{ G V. <8> Wi such that Xi is of maximum possible tensor length; we claim 
that [xi] G \\ u Vi ® Wi is of unbounded tensor length, hence not in the image of 
If not, say the tensor length of Xi is almost everywhere bounded by n. This gives 
a large set on which the statement "at least one of Vi or Wi has dimension n" is 
true. This large set is covered by the union of {i G / : V, has dimension n} and 



{i £ I : Wi has dimension n}, so by lemma B.0.9, at least one of them must be large. 
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This gives a large set on which at least one of Vi or W,- L is of bounded dimension, a 
contradiction. □ 

This next proposition justifies our calling the map $ 'natural'. 

Proposition 6.2.14. IfVi, Wi, Xi, Yi are collection of vector spaces, and fa : Vi — > 
Xi, tpi :Wi —> Yi linear maps, then the following commutes: 



u u 



[4>i]®bl>i, 



[4>i 



(6.2.2) 



l{X i ®llY i ->llX i ®Y i 



u u 



Proof Let [vi] <S> [wi] be a simple tensor in Yl u Vi <S> Yl u Wi. Chasing it both ways gives 
the same result: 



N <8> [wi 



[Vi <g) Wi] 



[4>i®wi 



[fa{Vi)\ <g> l4>i(Wi)] —> [<f>(Vi) <g> ^i(Wi)] 



□ 



6.3 Algebras and Coalgebras 

If (Li, multj) is a collection of algebras over the fields ki then it is easy to verify 
that Ylu^i * s an algebra over the field n«^«> under the obvious definitions of addition, 
multiplication, and scalar multiplication. The multiplication on Ylu^i * s m particular 
defined as the composition 

mult : YIU ® Y\Li A ® U [ ^ ] \\U 

u u u u 
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Alas, for coalgebras, things are not so easy. Here is what can go wrong. Suppose 
(Ci,Ai,Ei) is a collection of coalgebras over the fields hi. Then n«a ^ s a ^ l eas t a 
vector space over n w ^«- New let us try to define a co-multiplication map A on n«a- 
We start by writing 

a : na na ® a 

u u 

But as it stands, this won't suffice; we need A to point to n u Cj <S> T\ u Ci. As is 
shown in proposition |6.2.13[ unless the Cj are of boundedly finite dimension, we only 
have an inclusion n^a ® EL a EL a ® a> wnose image consists of those 

elements of bounded tensor length, and for a typical collection Cj of coalgebras it is 
usually a simple matter to come up with an element [c^] G n^a suc h that [Aj(q)] 
has unbounded tensor length. Thus, the A constructed above cannot be expected to 



point to riw^i ® ri M C« i n general (this problem is dealt with at length in section 9.1 ). 

Nonetheless, if the C{ are of boundedly finite dimension, then the map $ is an 
isomorphism, whence we can define 



a : na na ® a ^> na ® na 

WW u u 



Likewise, we define a co-unit map by 



e : na n** 



and we have 

Proposition 6.3.1. If(Ci,Ai,£i) is a collection of boundedly finite dimensional coal- 
gebras over the fields ki, then Y\ u Ci is a coalgebra over the field Y\ u ki, under the 
definitions of A and e given above. 
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Proof. We must verify diagrams 2.1.1 and 2.1.2 of definition 2.1.2 Consider 





u u 



Commutativity of the top middle triangle follows from the everywhere commutativity 



of it, which is diagram 2.1.2 applied to each Cj. The rest of the subpolygons are easy 



to verify, whence we have commutativity of the outermost, which is diagram 2.1.2 



Diagram 2.1.1 can be proved in a similar fashion. 



□ 



Proposition 6.3.2. Let Li be a collection of boundedly finite dimensional algebras 
over the fields fcj. Then there is a natural isomorphism of coalgebras 



which sends the tuple of functionals [fa : Li — y ki] on the left to the functional [fa] : 
H u Li -y Y\ u ki on the right. 

Proof. Call the claimed isomorphism That it is an isomorphism of vector spaces 
is clear from proposition |6.2.6 To see that it is a map of coalgebras we must verify 
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commutativity of 



m 

u 
[Ai] 

m ® a° 



aw 



rra ® rre ^ aw ® aw 

u u u u 



where A, denotes the coalgebra structure on L° and A that on (J\ u Li)° ■ Let mult* 
be the multiplication on the algebra Lj and mult that on so by definition 

mult = $ o [multj]. 

Let [ccj : L { — > kj\ be an arbitrary element of Ylu^i an< ^ ^ us chase it both ways. 
Working downward first, we ask how $ _1 ([Aj] ([««])) acts on Y\ u Li ® HL^' it does 
so by the composition 



u u u u u 



Next we ask how A(\P([a!j])) acts on Y\ u Li ® HL^- ^ does so by the composition 



ULi ® ^ n** 



u u 



But mult is defined to be $ o [multj], and so these actions are equal. This completes 
the proof. □ 
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Chapter 7 

The Restricted Ultraproduct of 
Neutral Tannakian Categories 

In this chapter we prove one of the main theorems of this dissertation, namely 
that a certain natural subcategory of an ultraproduct of neutral tannakian categories 
is also neutral tannakian. 

7.1 Smallness of the Category Rep^G 

Before beginning in earnest, we pause in this section to address a subtle but 
important point. If one wishes to consider an ultraproduct of a collection of 'things', 
those things must be sets; in particular, they must be relational structures. Thus, 
if one wishes to consider the ultraproduct of a collection of categories, then those 
categories must be small categories, and the necessary (abelian, tensor, etc.) structure 
on the categories must be realized as actual relations and functions on that set. This 
forces the question: for an affine group scheme G and field k, can Rep^G be taken to 
be small, up to tensorial equivalence? 

This question is not fully addressed in this dissertation, but we shall at least 
give here some arguments that lead us to believe that this is a fair assumption. In 

98 



particular, we shall argue why we believe that the category Vec^ can be taken to be 
small, up to tensorial equivalence. Similar arguments we believe should apply to the 
category Rep fc G for arbitrary k and G. 

For the remainder of this section, we shall use the term small and tensorially small 
in an abusive sense; the category C shall be said to be small if it is equivalent to a 
small category (even though she itself may not be), and the tensor category C shall 
be said to be tensorially small if there is a tensor preserving equivalence between C 



and a small tensor category (see definitions 3.2.2 and 3.2.5). 

Consider the category Vecfc of finite dimensional vector spaces over a field k, which 
can be identified as Rep fc Go, the category of finite dimensional representations of the 
trivial group Gq. Denote further by VEC^ the category of all vector spaces over k 
(finite dimensional or not). We observe first that the category VEC^ should by no 
means be assumed to be small. Even her skeleton would consist of objects of every 
possible dimension over k, and hence of sets of every possible cardinality. If this 
skeleton were indeed a set, we could take the union of all objects contained in that 
set, and therefore arrive at a set of cardinality greater than that of any other set. 
This is anathema according to the basic tenets of set theory. 

But the category Vecfc is indeed small. To see this, we shall follow page 93 of 
[13] in observing that any category is equivalent (though not necessarily tensorially 
equivalent) to its skeleton. To realize the skeleton of Vecfc as a small category, for 
each n we take the set V n = k n , i.e. the collection of all formal linear combinations 
of k over the set {1,2, ... ,n} with the obvious /c-vector space structure. Likewise 
define Hom(V„, V m ) to be the set of all functions from V n to V m which qualify as k- 
linear maps under the given vector space structures. Then the V n and Hom(V„, V m ), 
themselves being a collection of sets indexed by the sets N and N 2 , can indeed be 
collected into a single set. Thus Vec^ is a small category. 

What is far less obvious is that Vec^ is tensorially small. To illustrate the problem, 
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we again direct the reader to page 164 of |14j . where the author shows that the skeleton 
of the category of sets cannot be given the structure of a tensor category (in this case 
defined as the usual cartesian product of sets). This is the reason, after all, that 
we bother with the assoc, comm, and unit isomorphisms in a tannakian category; 
demanding, for example, that (X <g> Y) <g> Z = X <g> (Y <g> Z) (strict equality) is simply 
too stringent. For similar reasons we do not believe that it suffices to endow the 
skeleton of Vecfc with the structure of a tensor category in the usual sense. 

Here is what we believe is a possible approach to remedying this. Denote by Cq 
the skeleton of Vec/c as defined above, and for each n e N, denote by C n the following 
category. The objects of C n are the objects of C n _i, along with all pairwise tensor 
products of objects in C n _i (via whatever standard construction one likes, e.g. as a 
certain quotient of the free vector space on V x W; see section 1.7 of [9]). For objects 
V,W E C n , we let Rom Cn (V,W) be Hom Cn _ 1 (V, W) if V and W are in C n _ 1; and if 
not, as the collection of all functions from V to W which qualify as fc-linear maps. 
Finally, we define C to be the union of the categories C n for n = 0,1,2, .. .. 

Note that, since Co already contains an isomorphic copy of every finite dimensional 
vector space over k, C contains no new objects up to isomorphism. The whole point 
of bothering with these new objects is so as not to encounter any paradoxes similar 
to that described on page 164 of [Hj. Proving rigorously that this category satasfies 
the axioms of a tannakian category would no doubt require significant effort, but we 
believe that it could be done. 

We would define the primitive relations of the language of abelian tensor categories 
on this structure in the obvious manner. For instance, the relation o if> = rj would 
hold precisely when r\ is the composition of and if) in the usual sense, and similarly 
for (f) + if> = rj. Importantly, we would define the relation X <g> Y = Z to hold when Z 
is the unique object of C n+ i such that X <g> Y = Z, where n is the least integer such 
that X and Y are both objects of C n . The relation assocx,y,z = 4> would hold when 
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is the unique map (X <g> Y) <g> Z — > X <8> (Y <g> Z) such that sends (a; (2) y) <g) z to 
x <8> (y <8> z), and similarly for comm^ y = 0. As for the unit relation, denote by 1 the 
unique 1-dimensional vector space in the skeleton Cq, and define unitx = to hold 
when is the unique map X — > 1 (g) X such that : x h> 1 <g> x. 

Given that one could verify that this category C satisfies the axioms of a tannakian 
category, showing it to be tensorially equivalent to the usual Vec^, and hence small, 
should be straightforward; simply define F : C — > Veck to be the inclusion functor. 
This functor is clearly full, faithful, and essentially surjective, hence an equivalence. 



Showing F to be tensor preserving (see definition 3.2.5) would likewise be straight- 
forward. Finally, apply definition 1.10 and proposition 1.11 of j5] to see that if F is 
an equivalence, and if it is a tensor functor, then it is also a tensor equivalence, in 
the sense that its inverse can also be taken to be tensor preserving. Thus F is an 
equivalence of abelian tensor categories, and Vec^ is a tensorially small category. 



7.1.1 A Quotient Category Approach 

Here we mention briefly a possible alternative to the ultraproduct approach taken 
in this dissertation, one which replaces the ultraproduct with a certain quotient cat- 
egory. 

Let Gi be a collection of afline group schemes over the fields fcj, and let Ci be 
the category Rep^.G,. Denote by Oie/ ^ P r °duct of the categories that is, 
the category whose objects are all possible tuples of objects (Xi : i G J), and whose 
morphisms are all possible tuples of morphisms (0j : i £ I), with the obvious defi- 
nitions of morphism composition, addition of morphisms, tensor product of objects, 
etc. Fix a non-principal ultrafilter U on J, and for objects (Xi), (Y{) G J^[ ie7 - Cj , define 
the following congruence relation ~ on Hom((Xj), (Yi)): (0j) ~ (ipi) if and only if 
the subset of / on which 0j = ipi is large. Note that, if (0j), (ipi) : (Xi) — > (Yj) with 
(0i) ~ (ipi), and if (p { ), (m) : (Yi) ->■ (Z t ) with (p t ) ~ (p^, then (faopi) ~ (V>»°^)> as 
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the intersection of two large sets is also large (see definition B.0.1 ). Thus ~ is indeed 
a congruence relation. 

Let C denote the quotient category of rLe/^« with respect to ~. Then under the 
assumption of the previous section, namely that any Rep k G is tensorially equivalent 
to a small category, we have 

Theorem 7.1.1. The quotient category C and the ultraproduct category Y\ u Ci are 
equivalent as abelian tensor categories. 

Proof. For each i, denote by C« the posited small category, and by Fi : Ci — > C\ the 
posited tensor equivalence (i.e., the inclusion functor), of the previous section. Then 
by proposition 1.11 of [5], let Gi be the tensor preserving 'inverse' of Fi. Define a 
functor G : C — > X\ U C% as follows: for objects, G((Xi)) = [Gi(Xi)}, and for mor- 
phisms, G([0j]) = [Gi((f>i)]. That G is essentially surjective is clear from the essential 
surjectivity of each Gi, and similarly the fullness and faithfulness of G follows. Thus 
G is an equivalence. 

By hypothesis each Gi is a tensor equivalence, and so comes equipped with a 
functorial isomorphism c x . Yi : Gi(Xi) (g) G(Yi) — =4 Gi(Xi <g) YJ) for all X, h Yi G Cj. 
Then define a functorial isomorphism c as follows. For each pair of objects (Xi), (1^) 
in the quotient category C, define C(x i ),(y i ) : [Ci(Xj)] <S> [Gi(Yi)} — > [Gi(Xi ® 3^)] as 
the composition 



c{x iU Yi) ■ [Gi{Xi)\ ® [GiiYi)} A [Gi(Xi) ® Gi(Yi)] ^ [G 4 (X, ® K t )] 



where $ is the natural injection defined in proposition 6.2.13 , and [c l x . Y ] is the 



ultraproduct of the linear maps c x . Y . (see equation 6.2.1). So equipped with c, we 



believe G can now be shown to be tensor preserving according to definition 3.2.5 



Apply again proposition 1.11 of [5 J to see that G is indeed a tensor equivalence. □ 

The chief disadvantage of ultraproducts, as highlighted, is that the constituent 
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categories of the ultraproduct must be shown to be small relational structures; the 
quotient category approach does away with this requirement. On the other hand, 
the chief advantage of the ultraproduct approach is that one immediately has Los' 
theorem, which allows us to pass immediately from first-order statements on the 
factors to first-order statements in the ultraproduct. In particular, we may conclude 
immediately that Y\ u Ci is a tannakian category, simply by virtue of the fact that 'being 
tannakian' is a first-order sentence in the language of abelian tensor categories. Of 
course, given the tensorial smallness of Rep k G, since the ultraproduct and quotient 
categories are tensorially equivalent, we conclude that a Los' theorem-type result must 
indeed hold for the quotient category as well; but this can no longer be assumed, and 
must be proven, and is the chief disadvantage of the quotient category approach. 



7.2 The Restricted Ultraproduct 

For the remainder of this dissertation, by a (abelian, tensor, etc.) category, we 
shall always mean a small category realized as a structure in the language of abelian 
tensor categories, and by a tannakian category, we shall mean a structure satisfying 
the axioms given in chapter |4| 

Theorem 7.2.1. Let d be a sequence of tannakian categories indexed by I , U an 
ultrafilter on I . Then Y\ u Ci ^ s a tannakian category. 

Proof. The property of being tannakian, by the work done in chapter |4j is express- 
ible by a first-order sentence in the language of these structures. By Los's theorem 



(corollary CO. 16), the same sentence is true in the ultraproduct. □ 



A word or two about what [] C; actually looks like. If [xj\ is an element of 
then [xi] is an object or a morphism of Ylu^i according to whether the set on which 
Xi G Ob or Xi G Mor is large. The axioms of a category state that exactly one of these 
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statements hold in every slot, and in an ultrafilter, exactly one of a subset of I or its 
complement is large. It thus does no harm to think of every element [xj\ of W u Ci as 
being represented by a tuple (xj) consisting either entirely of objects or entirely of 
morphisms, since it is necessarily equivalent to a tuple (many in fact) of one of these 
forms. 

If [(pi], [ipi], [rji] are morphisms of [] M Cj, then the relation [fa] o [ipi] = [rji] holds 
if and only if the relation fa o ipi = rji hold for almost every i. Similarly the tensor 
product of the objects [Xi] and [Yi] is [Zi], where Zi denotes the unique object such 
that Xi®Yi = Z h In short, [fa] o [ipi] = [fa o ^], and [X;] <g> [Yi] = [X t <g> Yi]. 

By Los's theorem, the same is true for anything that can be expressed as a first- 
order concept in our language. For example, since 'being internal Horn' is first-order 



(see page 72) we conclude immediately that an internal Horn object for [Xi] and [Yj] 
is necessarily an object [Zi], where almost every Zi is an internal Horn object for Xi 
and Yi. Importantly, an identity object for Y\ u Ci is a tuple [Ui] such that Ui is an 
identity object for almost every i, and an endomorphism of [Ui] is an element [fa] 
consisting of morphisms which point from Ui to itself almost everywhere. 

One should take care however not to be hasty in concluding that a given categorical 
concept is inherited by \\ u Ci from the Ci, if you do not know beforehand that the 
concept is first-order. The following example illustrates this. 

Take Ci = Veck for a fixed field k, indexed by I = N. For objects A and B 
of Vecfc, consider the (non first-order) categorical statement 11 A is isomorphic to an 
n-fold direct sum of B for some n". Now the following are both first-order: "X is 
isomorphic to Y" and for fixed n, U A is an n-fold direct sum of B" . This means we 
can identify [Xi] n with [X"], and for fixed n, objects [Yi] that are isomorphic to [X,j\ n 
with tuples of objects (Yj) which are almost everywhere isomorphic to (X"). So let 
Vi G Ci be an z-dimensional vector space, and W{ G Ci a 1-dimensional vector space. 
Then the statement U V is isomorphic to an n-fold direct sum of W for some n" is 
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true of every Vi and W{, but the statement is clearly not true of the elements [Vi\ and 
[Wi] inside the category n«^- This observation in fact proves that the statement U A 
is isomorphic to an n-fold direct sum of B for some n" is not first-order. 

In what follows we fix, for each i, an identity object for Ci, and denote it by l i . 
We denote simply by 1 the object [lj of Ylu^- 

Proposition 7.2.2. Let Ci be a sequence of tannakian categories, and denote by hi 
the field End(l^). Then End(V) can be identified with k = Ylu^i> ^ e ultraproduct of 
the fields fcj. 

Proof. As mentioned, End(l) consists exactly of those elements [fa] such that fa 
is almost everywhere an endomorphism of 1 { . But this is exactly the description of 



W u ki (see section 6.1 ), if we identify k% = End(lj). The multiplication and addition in 
End(l) and Y\ u ki are clearly compatible with this identification, since multiplication 
is composition of maps, and addition is addition of morphisms. □ 

Now assume that Ci is a sequence of neutral tannakian categories, and denote by 
bji the fibre functor on each d. As mentioned in the introduction, we see no way to 
endow W u Ci with a fibre functor, at least not one that is compatible with the each 
of the bJi (n w Ci m ight thus be an interesting example of a non-neutral tannakian 
category, but that is not investigated in this dissertation). Instead we look to a 
certain subcategory of n^*- 

Definition 7.2.1. For a sequence of neutral tannakian categories Ci, the restricted 
ultraproduct of the Ci, denoted rifl^5 i s the full subcategory of Yl u Ci consisting of 
those objects [Xj] such that dim(a;j(Xj)) is almost everywhere bounded. 

To avoid the use of a double subscript, the notation \\ R Ci makes no mention of 
the particular ultrafilter U being applied. As U is always assumed to be fixed but 
arbitrary, no confusion should result. 
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If [Xi\ has almost everywhere bounded dimension, then we may as well take it to 
have everywhere bounded dimension. And if [Xi] is everywhere bounded, Xi takes 
on only finitely many values for its dimension; by lemma [B. 0.10 there is exactly one 



dimension m such that the set on which dim(cjj(Xj)) = m is large. Thus, it does no 
harm to think of Y\ R Ci as the full subcategory consisting of those (equivalence classes 
of) tuples having constant dimension. 

Theorem 7.2.3. Yl R C% is a tannakian subcategory ofY[ u Ci- 



Proof. By lemma 5.0.1 it is enough to show that Yl R Ci is closed under the taking 
of biproducts, subobjects, quotients, tensor products, duals, and contains the (an) 
identity object. 

Each bji is fcj-linear, hence additive, and theorem 3.11 of [7] ensures that Ui carries 
direct sums into direct sums, and hence biproducts into biproducts. If [Xi], [Yi] 
have constant dimension, then certainly so do the vector spaces Ui(Xi) © Ui(Yi) ~ 
Ui(Xi © Yi). Thus Yl R Ci is closed under the taking of biproducts. 

As each Ui is exact, it certainly preserves injectivity of maps, i.e. subobjects. 
Then if [Xi] has bounded dimension and \Yj\ is a subobject of [Xi], likewise \Yj\ must 
have bounded dimension, since a vector space has larger dimension than any of its 
subobjects. A similar argument holds for quotients; thus Y\ R Ci is closed under the 
taking of quotients and subobjects. 

That Yln^i i s dosed under the taking of tensor products is evident from the 
definition of a tensor functor; if [Xi] and [Yi] have constant dimension m and n 
respectively, then Ui(Xi © Yi) ~ Ui(Xi) © Ui(Yi) has constant dimension mn. 

That Yl R Ci has an identity object is similarly proved; tensor functors by definition 
carry identity objects to identity objects, and the only identity objects in Vecfc are 
1-dimensional vector spaces. 

Finally, we must show that the dual of an object [Xi] of Yl R Ci also has constant 
dimension. But this is evident from proposition 1.9 of [5], which says that cj, carries 
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dual objects to dual objects, and the dual of any vector space has dimension equal to 
itself. 

□ 

Now define a functor uj from X\ R C% to Vec^ as follows. For an object [JQ] of n^i) 



we define a;([Xj]) = f Y[ u ^i(^i) (ultraproduct of vector spaces; see section 6.2), and 



for a morphism [</>J, we define w( ) = [^(0^)] (ultraproduct of linear maps; see 
page 88). 

guar- 



Since [Xj\ e Y\ R Ci 1S assumed to have bounded dimension, proposition 6.2.4 



antees that u carries [JQ] into a finite dimensional vector space (hence the reason we 
restrict to 11^^ * n the ^ TS ^ place). As the ultraproduct of maps preserves composi- 



tion, and since ln M V; — [1 : V% ~ * Vi] (proposition 6.2.5), u is evidently a functor 



Theorem 7.2.4. u is a fibre functor on Yl R ^i- 



Proof. We prove first that u is a tensor functor. For two objects [Xi], [Yj] of 



we define the requisite natural isomorphism crxjjjYJ of definition 3.2.5 to be the com- 
position 



rM*i) ® IluiQQ A u.Ui{Xi) <g> Ui{Yi) [cx " Yi] > u.Ui{Xi ® y; 



where cx t ,Yi denotes the given requisite isomorphism in each individual category, and 



$ is the natural isomorphism defined in proposition 6.2.13 We need to verify that 
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the three conditions of definition 13.2.51 are satisfied. Condition 1. translates to 



u([Xi])®u(\Yi)® [Zi]) 




(w([X 4 ])®u;([F i ]))8)a;([Z 4 ]) 



u([Xi] ® ([yj ® [Zi])) 



w(assoc) 



w(([X<] <g> [X-]) <g> [Z<]) 



where assoc' denotes the usual associativity isomorphism in Vec^, and we have dropped 
the obvious subscripts on c. The expanded form of this diagram is 



[li]®[cy <lZi ] 



A 

Y\ui(Xi) ® ( em**) ® fm^ 



/ 

[wi(associ)] 

em(x< ® y) ® z^ 

l- C X i ®Y i ,Z l ] 



U"i( x i® Y i) ® em^) 
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Now consider the diagram 



u 

A 



FM-Xi) ® n^(^) 




Hi]®* 



assoc'o($®[l i ]) 



([cx i ,y i ]®[li])°* 



[cjfj.K® 1 !] 



A 



[wi(associ)] 



A 




where assoc^ is the associativity isomorphism in the category Vec*^. This diagram 
has four simple subpolygons. The middle polygon consisting of seven vertices is 
a contracted version of the previous diagram, and is what we are trying to prove 
commutes. Commutativity of the top and bottom triangles follow directly from the 



naturality of the isomorphism $ (see diagram 6.2.2 on page 94), and the left-most 



polygon can be verified directly by hand. And since all of the maps are isomorphisms, 
some diagram chasing shows that if the outermost six vertex polygon can be shown to 
commute, so also does the simple seven vertex polygon. But the outermost polygon 
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IS 



U.u>i(Xi) ® Ui(Y t ® Zi) 



[li'SlCY i ,Z i ] 



[^(associ)] 



]\uj i {{X i ®Y i )®Z i 




]\u i (X i ®Y i ) ®Ui{Zi 



and by theorem 6.2.9 commutativity of this diagram is equivalent to the almost 
everywhere commutativity of 

Ui(Xi) ® Ui(Y t ® Z^ 



CX i ,Y l ®Z l 




^(Xi ® (y< ® ^)) 



Wi(associ) 



^((Xj (8) Fi) <g> 2i 




But this is commutative everywhere, as it is merely condition 1. of definition |3.2.5 
by virtue of each uji being a tensor functor, u thus satisfies condition 1. 
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Condition 2. is proved similarly; consider the diagram 




[aji(commi)] 




The outermost hexagon is our expanded version of condition 2., and is what we must 
prove. Commutativity of the left trapezoid can be verified directly by hand. And 



again by theorem |6.2.9[ commutativity of the right trapezoid is equivalent to the 
almost everywhere commutativity of 



Ui(Xi) <g> Ui(Xi) ^ Ui(Xi ® Yi) 

Ui(commj) 

But this is condition 2. applied to each individual Ui, which commutes by assumption. 

For the purposes of this proof we shall replace condition 3. of definition |3.2.5| 
with the seemingly weaker but equivalent condition given in definition 1.8 of [3]: 
that whenever [Ui] is an identity object of \\ R Ci and [u^] : [U,j\ — > [U] <8> [Ui] an 
isomorphism, then so is a; ([£/»]) and cuQ-Uj]). Since any two identity objects of a tensor 
category are naturally isomorphic via a unique isomorphism commuting with the 
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unit maps (proposition 1.3 of [3]), we need only verify this for a single identity object, 
namely the pair [lj] and [unit^ij. As each cjj is a tensor functor, it sends lj to an 
identity object in Vec^, and we know of course that the only identity objects of Vec^ 



are 1-dimensional. Thus = riw^lj) * s 1-dimensional (proposition 6.2.4), thus 

o;([lj]) is an identity object of Vecfc. And again, as each cjj is a tensor functor, it 
sends uniti,! to an isomorphism Wj(lj) — > Wj(Ij) ®Wj(lj), whence uj sends [unit^ij to 
an isomorphism as well. 



uj is /c-linear by the fcj-linearity of each uji and proposition 6.2.5 



w([ai][0i] + [fa]) = w([ai<j>i + fa]) 
= [ui(ai<f>i + fa)] 
= [aiUi(<f>i) + uJi(fa)] 

= [a^UJ^fa)] + [uJi(fa)] 

= [ ai ]uj([fa]) + uj([fa]) 

uj is faithful: if [fa] and [ipi] are different morphisms, then (fa) and (fa) differ on a 
large set. By faithfulness of each uji, so do (uji(fa)) and (uji(fa)), and by proposition 



6.2.5, [oji(fa)] and [uji(fa)] are different linear maps. 



uj is exact by the exactness of each uj{, proposition 6.2.12 , and the fact that "is an 
exact sequence" is a first-order concept. The sequence 



[o] -> [xj J*I> %] [Zi\ [o] 



in Y\ R Ci is exact if and only if the constituent sequences 



-> X A K A Z, 



are almost everywhere exact, in which case uji of these sequences is almost everywhere 
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exact, in which case u of the first sequence is exact. This completes the proof. 



□ 



Corollary 7.2.5. If Gi is a sequence of affine group schemes defined over the fields 
hi, then \\_ R Rep ki Gi is (tensorially equivalent to) Rep Uuk .G for some affine group 
scheme G. 



Proof. By theorems 7.2.3 and 7.2.4, r} fl Rep fc .Gj is a neutral tannakian category over 
the field Apply theorem 3.3.2 □ 
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Chapter 8 



Finite Dimensional Subcoalgebras 
of Hopf Algebras 



In this chapter we take a break entirely from working with ultraproducts; no 
understanding of them is required here whatsoever. The main theorem of this chapter 
is perhaps of interest in its own right, but for our purposes mostly serves as an 
invaluable lemma with which to prove the main theorem of the next chapter. 

Here we investigate the special case of when a finite dimensional comodule C over 
a Hopf algebra (A,A,e) over a field k is actually a sub-coalgebra of A; our intent is 
to show that these satisfy some very nice regularity properties in terms of how they 
sit inside the category ComodA- To say that C C A is a subcoalgebra is to simply 
say that the image of the map A : A — > A ® A, when restricted to C, is contained 
inside C <8> C C A £g> A, and that we are regarding the map A as the (left or right, 
depending) A-comodule structure for C . Throughout we will use the same symbols 
A and e for their restrictions to C. 

In a sense though, the case of a finite dimensional A-comodule being a subcoalge- 
bra of A is really not that special. The fundamental theorem of coalgebras (theorem 



2.2.3) states that any coalgebra (and hence Hopf algebra) is a directed union of finite 
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dimensional subcoalgebras. Further, theorem 2.2.2 states that any A-comodule can 
be embedded in some n-fold direct sum of the regular representation. Thus every 
finite dimensional A-comodule can be embedded in C n , where C is some finite di- 
mensional subcoalgebra of A. We see then that the entire category Comodyi can be 
realized as a direct limit of the principal subcategories (C), where C ranges over all 
finite dimensional subcoalgebras of A. Anything categorical we can say in general 
about these subcoalgebras of A must surely then (and will) be of value. 

We would also like to mention that, so far as we can tell, these results are valid 
for any coalgebra A, Hopf algebra or not. Nonetheless, as all of our applications of 
these results will be toward Hopf algebras, we leave them as stated. 

Let C be a subcoalgebra of (A, A, e). Since A restricts to C <S> C on C, we can 
think of C as both a left and a right comodule over A. That is 

A:C ^C®C C C®A 

gives a right A-comodule structure for C, and 

A:C ^C®C CA®C 

gives a left A-comodule structure for C. Unless C is co- commutative we can expect 
these structures in general to be quite different. 

For the remainder of this chapter denote by Cr the category of finite dimensional 
right A-comodules, and denote by ujr the fibre (i.e. forgetful) functor Cr — > Vec^. 
Define similarly Cl and ojl. For a finite dimensional subcoalgebra C of A denote by 
Endc R (C) the algebra of all endomorphisms on C, where we consider C as an object 
in the category Cr, as defined above; make a similar definition for Endc L (C). Denote 
as usual by End(o;^| (C)) the collection of all natural transformations of the fibre 



functor lor restricted to the principal subcategory (C) (see definition 3.3.3), similarly 
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for End(u; L |(C)) 

The remainder of this chapter is devoted to proving, piecemeal, the following: 

Theorem 8.0.6. Let C be a finite dimensional subcoalgebra of the Hopf algebra A 
over the field k. Then 

1. Endc R (C) = End(ojL,\(C)) = the centralizer of Endc L (C), and Endc L {C) = 
End(u)n\(C)) = the centralizer of Endc R (C) 

2. All of the above are canonically isomorphic to the dual algebra of the coalgebra 
C. 

It is clear from the remarks on page 135 of [5j, combined with lemma 2.13 of the 
same text, that the author is quite aware that the algebra C* is isomorphic to both 
End(coji\(C)) and End(o>£|(C)). This is not surprising; we shall argue at the end of 



this chapter that this theorem in fact proves that the 'algorithm' given in section 3.3 



for recovering the Hopf algebra A from the category Comod^ does in fact give the 



correct answer. As to the other assertions of theorem |8.0.6[ we are unable to locate 
any specific occurrence of them in the literature. 

In the statement of the theorem we have deliberately confused (as we may, by the 



discussion on page 43) End(oj r\(C}) with its image inside Endv e c fe (wi?(C))- Note that 
these are equalities given in 1. above, not just isomorphisms. 

We will prove first that C*, the dual algebra to the coalgebra C, is isomorphic to 
Endc i j(C). We define maps 



C* AEnd Cfl (C) 
C* A End CR (C) 



as follows. For a G C* , Q(a) is the composition 



C ^C®C k®C ~C 
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and for G End Cfl (C), T(0) is the composition 



C 



c 



Theorem 8.0.7. The maps Q and T are well-defined algebra maps, and are left and 
right-sided inverses for one another, making them both isomorphisms of algebras. 

Proof. We need to prove first the non-obvious fact that, for any a G C*, il(a) is an 
endomorphism on C as a right A-comodule, that is, that the diagram 



c — n(a) > C 



c®c -—^C®C 



commutes. Consider the diagram 



c — ^c®c am > k®C 



1®A 



1®A 



c 



C ®C —^C ®C ®C >k®C®C — >c®c 

A®1 a®l®l ~ 



The outermost rectangle is an expanded version of the previous diagram, and is what 
we are trying to prove commutes. Commutativity of the right-most simple rectangle 
follows directly from the naturality of ~, commutativity of the middle rectangle is 
obvious, and the left-most rectangle is a coalgebra identity. Thus the outermost 
rectangle commutes, and Q(a) is indeed an endomorphism of C as a right A-comodule. 
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We argue now that ttoT and fofi are both the identity. Let a G C*, and consider 




C®k > k®k 



The top line is the map r(f2(a)). We would like to see that this is equal to a, 
and a is clearly equal to the bottom three-map composition; thus we seek to prove 
commutativity of the outermost polygon. Commutativity of the right-most simple 
polygon follows again from the naturality of ~, commutativity of the middle square 
is obvious, and the left-most triangle is again a coalgebra identity; thus r(fi(a)) = a. 
Now let <f) e End Cfl (C). Consider 

C ^ C ®C^lk®C^^C 




C ^C®C 



The top line is the map Q(T ((}>)), which we would like to see is equal to <fi. Commu- 
tativity of the left-most square is the assertion that (f> is an endomorphism of C as 
a right A-comodule, and commutativity of the middle triangle is obvious. Thus the 
outermost polygon commutes, giving us 

n(T(4>)) = 0o (A o ( £ ® l)o ~) 

But (A o (e <g) l)o ~) = 1 is coalgebra identity, and hence the right hand side is equal 
to 0, proving the claim. 

We must finally prove that T is a A;- algebra map. Recall the multiplication on C*; 
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it sends the pair of functionals a, /3 : C — >■ k to the functional 



Let <f),ip G Endi?(C), and consider the diagram 




4> ip 



The composition that starts at the bottom left hand corner, goes up, and then all 
the way across, is an expanded version of the map T(0) * r(^), where * denotes 
the multiplication in the algebra C*. The one that starts at the bottom left hand 
corner, goes across, and then diagonally up, is the map r(0 o ■0); we want to see of 
course that these are equal. It is enough to show then that all of the simple polygons 
commute. Starting from the left: commutativity of the first is the assertion that 
is an endomorphism of C as a right A-comodule, the second is a coalgebra identity, 
and commutativity of the third and fourth follow directly from the naturality of ~. 
Therefore T is a multiplicative map, and is obviously /c-linear, since composition with 
e (or any linear map) is so. Therefore T is an isomorphism of /c-algebras. The same 
is true of fl, since it is the inverse of such a map. □ 

We claim also that C* is in much the same way isomorphic to End^C), the 
endomorphism algebra of C as a left A-comodule. This time we define a map 

C* A End Ci (C) 
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as, for a G C*, 0(a) is the composition 



C ^C®C ^C®k~ C 



(notice the switching of the slots on which 1 and a act). We define a map A : 
End Ci (C) — > C* the same way as before: for an endomorphism of C in the category 
Cl, A(0) is the composition 

C C k 

An proof almost identical to that of the previous theorem shows again that 6 and A 
are isomorphisms of /c-algebras, which we will not repeat. 

Lemma 8.0.8. Let V be a finite dimensional vector space over a field k, and let 
r, s G V ®V such that r ^ s. Then there exists a linear functional a : V — > k such 
that the composition 

V <g> V k <g> V ~ V 
sends r and s to different things. 

Proof. Let e±, . . . , e n be a basis for V. Then we can write 



If r ^ s, then Cy 7^ for some i and j. Then let a be the functional which sends e* 
to 1 and all other to 0. Then the composition above sends r to 

Cj,iei + Cj )2 e 2 + . . . + Cjj-ej + . . . + c 
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while it sends s to 



dLiei + dioe 2 + ■■■ + due* + . . . + di 



and these are clearly not equal, since 7^ dij. 



□ 



Theorem 8.0.9. Let C be a finite dimensional subcoalgebra of the Hopf algebra A. 
Then Endc L {C) = the centralizer of Endc R (C). 

Proof. Let G End Ci (C) and tp G End Cfl (C), and consider the diagram 



C 



C 



C 



C 



c®c 



c®c 



c®c 



c®c 



Commutativity of all three of the simple squares are merely the assertions that <p and 
■0 are morphisms in the categories Cl and Cr respectively; thus this entire diagram 
commutes. If we look at the rectangle consisting of the two left squares we obtain 
0o-0oA = A o (ip <g) 0), and looking at the rectangle consisting of the two right 
squares we obtain -0o0oA = Ao(-0(g)0). Thus we have 

0O-0oA="0O0oA 



But A is injective, and hence o ip = ip o 0. This shows that Endc L (C) is contained 
in the centralizer of Endc K (C). 

Now let be any linear map C — ?■ C, and suppose that it is not a member of 
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Endc L (C), which is to say that the diagram 



C 



c 



c® c 



l®4> 



c®c 



does not commute; we claim there exists a member of Endc fl (C) with which <fr does 



not commute. Recall from theorem |8.0.7| that for any linear functional a : C — > k, 
the composition 

c A C ® C k <g> c ~ c 

belongs to Endc R (C); our job is then to find an a so that <p does not commute with 
this map. Consider 



C > C ®C > k®C 



c 



c 



c®c 



k®C 



c 



All of the simple squares of this diagram commute, except for the left most one, 
which does not by assumption. We want to show that there is an a such that the 
outermost rectangle does not commute. Pick v G C such that commutativity of the 
left square fails, let r G C ® C be its image when chasing it one way, and s G C <g> C 
its image when chasing it the other way. By the previous lemma, pick a such that 
the composition 



C®C >k®C 



c 



sends r and s to different things, let's say m ^ I. Then if we chase v around one path 
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of the outermost rectangle we arrive at m, and the other way, we arrive at I; thus 
the outermost rectangle does not commute. This gives a member of Endc H (C) with 
which cf> does not commute, and the theorem is proved. □ 

An identical proof shows that End Ci{ (C) is the centralizer of End CL (C), which we 
do not repeat. 

Our last task is to show that End CL (C) is equal to End(cu R \(C)). Any member 
: C — > C of the latter must at the least make diagrams of the form 

C 



V 




c 


> c 



commute, where ip is an arbitrary element of Endc R (C). As Endc i (C) is equal to 
the commutator of End Cfl (C), we already have the forward inclusion End(u; R \{C)) C 
Endc L (C); it remains to show the reverse. 

Lemma 8.0.10. Let (V, p) , (W, [/,) be finite dimensional comodules over the Hopf 
algebra A. Fix bases e±, . . . , e n , fi, ■ ■ ■ , f m for V and W respectively, and write 

p : ej i-> e { ® 

i 

V '■ fj *-> XI f* ® bi i 

i 

1. If V is a subobject ofW, then each aij is a linear combination of the bij. 

2. If W is a quotient object ofV, then each is a linear combination of the a^. 

Proof. This is a simple fact from linear algebra if we think of (a^) and (bij) as 
matrices. If : V — > W is a linear map, write it as the matrix (c^-) in the relevant 
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bases. Then <fi being a morphism of A-comodules is equivalent to the matrix equality 

In case (c^) is injective it has a right-sided inverse, given say by the matrix 
Then (a^) = (dij)(bij)(cij), and clearly every entry of the right hand side is a linear 
combination of the fe^-; this proves 1. If (c y ) is surjective it has a left-sided inverse, 
again call it ((%). Then we have (cy)(ay)(dy) = (6^). This proves 2. 

□ 

Lemma 8.0.11. // (V, p : V — > V <E> -A) «s a hg/ii A-comodule, and if it belongs to the 
principal subcategory (C), then the image of p is contained inV ®C . 

Proof. The claim is obvious in case V is direct sum of copies of C, since then its 
comodule map is the composition 

A™ : C n (C <g) C) n ~C n ®C cC n ®A 

Suppose then (X, p) is a quotient of some C n . Then choose (a^) for X and (6^) 
for C n as in the previous lemma. Each 6^ is in C by assumption, and thus so is each 
aij, being a linear combination of the by. A similar argument holds if we consider a 
subobject of the quotient object (X, p). 

□ 

Thus, for any object (X, p) G (C), we can write p : X — > X ® C instead of 
p : X -> X <g> A 

Let : C — >■ C be any linear map. Then for n € N we define 

. C n^ C n 

as n , that is, the unique linear map commuting with all of the canonical injections 
C C n . Now let (X, p x ) be any object of (C). Then there is an object (Y, py) of 
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(C) and a commutative diagram 



Y <^ 



Px 



PY 



X <g> C c > y ® c « — C n <8> C 

t<S)l 7r®l 



Now, if defines an element of End(cuR|(C)), there exist unique linear maps (px and 
0y making 




commute. But, unless we know a priori that (p defines an element of End(a;jj|(C)), all 
we can say is that (fry exists, but may not be unique, and that <px is unique, but may 
not exist. Further, if we choose another such n, (Y,p Y ), n and i, one cannot expect 
to obtain the same linear map (px, again unless we know that <fi G End(tUR|(C)). For 
the moment then, we make the following deliberately ambiguous definition. 

Definition 8.0.2. Let X be an object of (C), : C — > C any linear map. Then 
we define (px '■ X — > X to be any linear map satisfying any one of the following 
conditions. 



1. If X = C n , then (p x = (p n - 

2. There exists an n and a surjective map C n — — >■ X such that n o (p x = (p n o it. 

3. There exists a quotient object Y of C n such that (p Y exists and satisfies condition 
2. above, and there is an injective map X — > Y such that (px ° t = t o <f> Y . 
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So, when we prove theorems about "the" map (f>x, it is understood to apply to 
any (fi x satisfying any one of the above conditions, and under the assumption that it 
exists in the first place. It will only later be a consequence of these theorems that (fix 
is well-defined; that it always exists, and is always unique. 

Lemma 8.0.12. Let (X,p x ) be an object of (C), (fi an element of End CL (C). Then 
the following is always commutative: 



X 



<i>x 



X 



PX 



PX 



x®c 



x®c 



Proof The claim is obvious if X = C, since then the above diagram is the definition 
of (fi being an endomorphism of C as a left A-comodule. If X = C n , consider 



C n 



(c <g) cy 



c n 



(c ® cy 



c n ®c -> c n ® c 



Commutativity of each of the squares is obvious, and commutativity of the outermost 
rectangle is what is desired. 

Now suppose C n — ^ X is a quotient of C n and that (fix satisfies condition 2. of 
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definition 18.0.21 Consider 



7T®1 




100 



We seek to prove commutativity of the right-most square. Commutativity of the 
left most square has been proved, the middle square commutes by definition, the 
top and bottom rectangles commute because 7r is a map of right comodules, and 
commutativity of the outermost polygon is obvious. If one starts at the second 
occurrence of C n at the top and does some diagram chasing, he eventually ob- 
tains n o (p x o (1 <g> 0)) = TT o (0 X o p x ). But TT is surjective, and thus we have 
px ° (1 <8> 0) = (px ° Px, and the claim is proved. 

Now suppose (X, px) is a subobject of the quotient object Y via the map X — > Y, 



so that (px satisfies condition 3. of definition 8.0.2 and that 0y satisfies condition 2. 
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Consider 



«g>i 



Y ®C 



PY 



Y 



Y®C< 



Y 



X ^X(g)C 



-=>X >x®c 



Px 



«g>l 

Commutativity of the right most square is again what we seek to prove. We have 
proved commutativity of the left square, the middle commutes by definition, the top 
and bottom rectangles commute since t is a map of comodules, and commutativity 
of the outermost rectangle is obvious. Starting at X on the top line, some diagram 
chasing shows that px ° (1 <8> 0) ° (t <E> 1) = <fix ° Px ° <S> 1)- But Kg) 1 is injective, 
whence we have px ° (1 <8> 0) = (px ° Px- The lemma is proved. □ 

Proposition 8.0.13. // (X,p x ), (Y, p Y ) are any objects of (C), ip : X — > Y any 
morphism in Cr, and an element of Endc L (C), then the following commutes: 
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Proof. Consider 



X 



4>x 



X 



Y 



Y 



I'X 



PX 



PY 



PY 



X®C 



X®C 



Y®C 



Y®C 



Commutativity of the right and left squares follow from lemma [8.0. 12| and the middle 
square commutes because ip is a morphism in Cr. Thus this entire diagram commutes. 
Looking at the left two-square rectangle we have 



(px ° tp ° Py = Px o {i> 



and at the right we have 



ijj o y o p Y = px ° (ip <8> 0) 



Thus (0x of)oj9y = (f® 0y) o p y . But py is injective (as all comodule maps are), 
hence 0x ° ip — o 0y, and the proposition is proved. 

□ 

All that is left to show is that the map (px, for (p G Endc L (C) and X G (C), 
actually exists and is unique. 

Uniqueness is immediate: if (px and (p' x are any two maps satisfying definition 



8.0.2, then they both satisfy the hypotheses of the previous proposition. As the 
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identity map 1 : X — > X is a morphism in Cr, the diagram 




commutes, showing (fix and (fi' x to be equal. 

For existence, suppose first that X is a quotient of C n under tt. Then (fix certainly 
exists; if e« is a basis for X, pull each back through ir~ l , down through (fi n , and 
back through 7r. But we know that (fix is unique, and the only reason that it would 
be unique is because (fi n stabilizes the kernel of 7r; otherwise there would be many (fix 



satisfying condition 2. of definition 8.0.2 This observation applies to any surjective 
map on C n . As every subobject of C n is necessarily the kernel of some surjective map 
we have proved 

Proposition 8.0.14. If (fi £ Endc L (C), (fi n stabilizes all subobjects of C n . 

Now let Y be a subobject of the quotient object X, with X — > X/Y the canonical 
projection. Then there exists a map (fix/Y making 



X 



4>x 



X 



X/Y 

<t>X/Y 

X/Y 
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commute. But this 4>x/y also makes 




commute, in particular the outermost rectangle. Thus <fix/Y satisfies condition 2. of 



definition 8.0.2, and is hence unique; but once again, the only reason this would be 
true is if (fix stabilized the kernel of tt, namely Y . We have proved 



Proposition 8.0.15. If X is a quotient object of C" and <fix satisfies condition 2. of 



definition 8.0.2, then (fix stabilizes all subobjects of X . 



Finally, if Y is a subobject of the quotient object X, then (fix stabilizes Y, whence 
there is a map 0y making 



Y c 



X 



4> Y 



4>x 



Y <=_ 



X 



In all cases then (fix exists and is unique. We have proved 



Theorem 8.0.16. For a subcoalgebra C of A, Endc L (C) = End(u R \(C)). 



8.1 Corollaries 

Here we record some results based on the above which will be used later. We 
will prove the results for the category Cr, we leave it to the reader to formulate the 
obvious analogues for Cl. 
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Proposition 8.1.1. Let C and D be finite dimensional subcoalgebras of the Hopf 
algebra A with C C D. Then the transition mapping End(u r\(D)) — > End(u r\(C)) is 
dual to the inclusion map C — )■ D via the isomorphism End(un\{C)) = Endc L (C) ~ 
C* . In particular, this transition mapping is surjective. 



Proof. Let C — > D be the inclusion mapping. Recall from page 120 that we have 
an isomorphism A : End^^l (D)) — > D* given by, for <p : D — > D, A(0) is the 
composition 



and that A has an inverse G which sends a linear functional a to 



D ^ D®D ^ D®k~D 



As i is a map of coalgebras l* of algebras, and we have a commutative diagram 



D* 

A 



c* 



e 



EndKJ( J D))^>End(u; fi |(C)) 



where T is some as of yet unidentified algebra map. Note that since i is injective, i* 
is surjective, and hence so is T. We claim that T is the usual transition mapping. 
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Let G End(uu R \(D)), and consider 




Commutativity of the top left most simple polygon is the assertion that i is a map 
of coalgebras, and commutativity of all the other simple polygons in the top row is 
trivial. The first polygon in the bottom row is the assertion that is an endomorphism 
on D as a left comodule (which it is, by the equality End(o;ij| (D)) = Endc L (-D)) and 
the last polygon is a coalgebra identity; thus this entire diagram commutes. Now the 
composition comprising the entire top line is exactly the map T(0) = f 6(t*(A(0))), 
so the outermost polygon of this diagram is 



C 



T{4>) 



c 



D 



V 

D 



But, the image of under the transition mapping End(cj fi | (D)) — > End(u!R\(C)) is 
by definition the unique linear map <pc '■ C — > C that makes this diagram commute. 
Hence (fic —T((f>), and the proposition is proved. □ 

Theorem 8.1.2. Let D be a finite dimensional subcoalgebra of A, End(to r\(D)) — L 
any surjective mapping of algebras. Then there exists a subcoalgebra C of D such that 
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L is isomorphic to End(u> r\(C)) and 



End(u R \{D)) 



-> L 



End(u R \(C}) 



commutes, where T is the transition mapping. 



Proof. As End(u>R\(D)) — > D* is an isomorphism of algebras, any quotient of the 
former gives rise to a quotient of the latter via 



D 



C* 



End(uj R \(D)) 



L 



Here we have skipped a step and denoted this algebra by C*, since, being finite 
dimensional, it is necessarily the dual algebra to a unique coalgebra C. Note that C 
is a subcoalgebra of D via the map (ir')* and the natural isomorphism C*° ~ C. This 
means that we can take n' to be dual to the inclusion mapping C — H> D; let us then 
replace ir' with l*. 

We need to verify that L can be identified with End(cu R \(C)). Consider 



D 



End(u R \(D)) 




EndK|(C7» 
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Commutativity of the outermost polygon was proved in proposition 8.1.1, and com- 
mutativity of the square is given. Define the map £ to pass back up through the 
isomorphism with C* and then down through O, i.e. £ =~ _1 o@. This S is the 
isomorphism we seek; the theorem is proved. □ 

There is an obvious analogue to this theorem as regards subalgebras of End(a;^| (D)) 
as opposed to quotients, which we will state but not prove. 

Theorem 8.1.3. Let D be a finite dimensional subcoalgebra of A, and L End(uji\(D) 
any injective mapping of algebras. Then there exists a quotient coalgebra C of D such 
that L is isomorphic to End(uji\(C)) and 

—> End(ujR\(D)) 



End(u R \(C)) 
commutes, where T is the transition mapping. 

As promised at the beginning of this chapter, these theorems actually prove 



that the 'algorithm' described in section 3.3 for recovering the Hopf algebra A from 



Comodyi does in fact give the correct answer. (This does not prove the general prin- 
ciple of tannakian duality; here we are assuming from the outset that the category 
we are looking at is ComodA for some Hopf algebra A). 

Theorem 8.1.4. Let C = ComodA for some Hopf algebra A. Then A can be recovered 
as the direct limit of the finite dimensional coalgebras End(u\(X))° , with the direct 
system being the maps End(uj\{X))° — End(u\(Y))° whenever X G (Y), where 
End(u\(Y)) End(u\(X)) is the transition mapping. 

Proof. As argued at the beginning of this chapter, the entire category Comod^ can be 
recovered as the direct limit of the principal subcategories (C), where C ranges over 
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all finite dimensional subcoalgebras of A, with the direct system being the inclusions 



C C D. Lemma 3.4.2 tells that, for purposes of computing the direct limit, we are 
justified in disregarding all objects but these subcoalgebras and all maps but these 



inclusions. But theorem 8.0.6 tells us that End(co>| (C))° ~ C, and proposition 8.1.1 
tells us that under this isomorphism, the map End(u;|(C)) End(u;|(L>)) can be 
identified with the inclusion map C C D. Apply theorem |2. 2. 3| to see that the direct 
limit of these is exactly the Hopf algebra A. □ 
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Chapter 9 

The Representing Hopf Algebra of 
a Restricted Ultraproduct 

Let ki be an indexed collection of fields, (A4, Aj,£j) a collection of Hopf algebras 
over those fields, and Ci the category Comedy • The work done in chapter [7] tells us 
that the restricted ultraproduct Y\ R Ci is itself a neutral tannakian category over the 
field k = n w ^5 hence tensorially equivalent to Comodyi^ where A^ is some Hopf 
algebra over k. The question then: what is A^l 

Before starting in earnest, let us examine an obvious first guess: is the ul- 
traproduct of the Hopf algebras A^. But an ultraproduct of Hopf algebras is not, in 
general, a Hopf algebra. An ultraproduct of algebras over the fields ki is indeed an 
algebra over the field fcj, with the obvious definitions of addition, multiplication, and 
scalar multiplication. The problem comes when we try to give it the structure of a 
coalgebra, consistent with the coalgebra structures on each A4. We start by writing 

A : Y\Ai -^4 YlAi ® Ai 

u u 

But as it stands, this does not suffice; we need A to point to flw^* ® EL^*- R- eca U 
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from proposition 6.2.13 that there is a natural injective map 




and that, unless the A{ have boundedly finite dimensionality, it is not surjective. The 
image of $ consists exactly of those elements [vi] which have bounded tensor length, 
and for a given collection of non-boundedly finite dimension, it is a relatively 
simple matter to come up with an element [a^] G n«^« such that [Aj(aj)] has un- 
bounded tensor length. Thus we cannot expect the image of A constructed above to 
be contained in riw^ ® EL^* in general. 

The next section is devoted to identifying a certain subset of which can 

indeed be given the structure of a coalgebra, using the definition of A given above. 
Thereafter we will show that this coalgebra is in fact a Hopf algebra, indeed equal to 
the Aoo we seek. 

9.1 The Restricted Ultraproduct of Hopf Algebras 

To allay some of the suspense, we give the following definition, whose meaning 
will not be clear until later in this section. 

Definition 9.1.1. The restricted ultraproduct of the Hopf algebras A iy denoted 
Ar, is the subset of n«^« consisting of those elements [a«] such that rank(aj) is 
bounded. 

Our goal in this section is to define exactly what "rank" means in this context, 
and to show that Ar can be given the structure of a coalgebra. Note that the notation 
A R makes no mention of the particular ultrafilter U being applied. As U is always 
understood to be fixed but arbitrary, no confusion should result. 

Let A be a Hopf algebra, C = Comodyi. For each X G C let Lx be the image 
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43h , and L x L Y the usual transition 



of End (u I (X)) inside End(co>(X)) (see page 
mapping for X G (Y). Then we have an inverse system of algebras, and from it 
obtain an inverse limit 

hm L x 

c 




Ty,Z 




rp 

Lemma 9.1.1. If C is a subcoalgebra of A, then limLx — ^> Lq is surjective. 



Proof. By the discussion on page 114 the entire category C is generated by the prin- 
cipal subcategories (C), where C ranges over all finite dimensional subcoalgebras of 
A, with the direct system being inclusion mappings C C D when applicable. We can 



then apply lemma 3.4.2 to see that we might as well have recovered l^mLx with re- 
spect to the sub-inverse system consisting of all subcoalgebras of A under the inclusion 
mappings, that is as 



limLx 

c 




Tc,D 




where C and D range over all subcoalgebras of A, and Tc,d defined when C C D. 



Proposition 8.1.1 tells us that Tc,d is always surjective, and it is a standard fact about 



inverse limits that if this is the case, Tc is always surjective. 



□ 



Now apply the finite dual operation to the above inverse system diagram to obtain 
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a diagram of coalgebras: 



c 





Y.Z 



44 



that this 



The maps Ly — L°z therefore form a direct system; recall from page 
is exactly the direct system from which A can be recovered as its direct limit. Let us 
rename T Y z as <Py,z, and so we have the direct limit diagram 



limL^ 





with A = fini^ L° x . By the universal property of direct limits there is a unique map 
of coalgebras, call it <f> : A — > (|imLx)°, making the following diagram commute for 
all Y e (Z): 




(9.1.1) 



Proposition 9.1.2. The map is injective. 

Proof. Recall the concrete definition of a direct limit of algebraic objects; its un- 
derlying set consists of equivalence classes [a] where a is some element of some L Y , 
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with a G Ly, b G Lz equivalent when there is some with Y, Z G (T) such that 
^y,r( a ) = 4>z,t(o)- As is discussed on page 
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for any object Y, there is a sub- 



coalgebra C of A such that K G (C); this shows that any element a of any Ly is 
equivalent to some element c of L° c for some subcoalgebra C oi A. Thus, every ele- 
ment of A = lux^L° x can be written as [c], for some c G L^, C a coalgebra. Further, 
given elements [c] and [d] of liniL^, we can clearly choose subcoalgebras C and D 
of A so that c G Lq, d G L^,, and C d D (just enlarge Z) to be a finite dimensional 
coalgebra containing both C and D). 

So let [c], [d] G liniL^, with c G L^, d G LJ,, and C C D, and suppose that 
maps [c] and [d] to the same element. Consider the diagram 




4>C,D 



To say that <f>([c]) = <j>([d\) is the same as saying that T^(c) = Tp(d). But To is 



surjective by lemma 9.1.1 thus T D is injective. By commutativity of 




we see that we must have Tq d (c) = d; this means that [c] = [d]. 



□ 



The map is by no means generally surjective; the author verified this with a 
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counterexample which he will not burden you with. 

We pause for a moment to see what the map actually looks like. A typical 
element of A = lm^L° x is an equivalence class [a : L Y —> k], where Y is some object 
of C. Passing this element through is the same as pulling it back through <f> Y , and 
passing back up through T Y . Thus <f>([a :Y—>k]) is the composition 

l^mLx Ly —> k 

x 

Recall the definition of the finite dual L° of the algebra L; it consists of those linear 
functionals a : L — > k which happen to kill an ideal of L having finite codimension. 
We therefore define the rank of an element of L° as the minimum m such that a kills 
an ideal of codimension m. 

Definition 9.1.2. The rank of an element of a G A is the rank of 0(a) G (\^mL x )°- 

Proposition 9.1.3. Let (L,mult) be any algebra, (L°,A) its finite dual. Then if 
a G L° has rank m, A(a) can be written as a sum of no more than m simple tensors. 
Further, we can write 

m 

A(a) = J^A®7i 
where each fii and 7« themselves have rank no larger than m. 

Proof. Let I<\L be an ideal of codimension m such that cx(I) = 0, and let ei, . . . , e m G 
L be such that e± + /,..., e m + I is a basis for L/I, and extend the to a basis 
e±, . . . , e m , fi, f 2 , ■ ■ ■ for all of L. Recall the definition of A in terms of mult; it sends 
the functional a to the map 

and then passes to the isomorphism (L (g) L)° ~ L° ® L°. We have a basis for L® L, 
namely those tensors of the form ej ® /j, fi <8> e^, ® /j, ® e^. Since / is an ideal, 
the only of these basis elements that might not get sent to / under mult are those of 
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the form e { (g> ej, i,j < m. Then since a kills /, A (a) kills all but the <S> ej. Letting 
7i be the functional that sends to 1 and everything else to zero, we can write 

A(a) = ® ^ = Yl 7i ( Yl Ci ^i j 

for some scalars c^-, which is a sum of no more than m simple tensors. As each 7^ 
still kills J, rank(7j) < m for each i, and the last claim is proved as well. □ 

We shall need the following lemma from linear algebra. 

Lemma 9.1.4. Let V,W be vector spaces over some field, and let Y^i=i v i ® w i e 
V <S> W . Then this expression is of minimal tensor length if and only if the vectors V; L 
are linearly independent, and the Wi are linearly independent. 

Proof. Suppose that one of the collections or Wi are not linearly independent, let's 
say the V; L . Then say v n is in the span of v±, . . . , i> n -i, and write 

Vn = aivi + . . . + a n - X v n -\ 

Then 

n n—1 
^ Vi <S> Wi = ^2 Vi® Wi + v n ® w n 

1=1 1=1 

n—1 n—1 

= ^2 Vi ® Wi + aiVj ^ ® Wn 
i=i i=i 

n-1 

= ^2vi<g) (lVi + aiW n ) 
i=l 

which is a sum of less than n simple tensors. Therefore the expression is not of 
minimal tensor length. 

Conversely, suppose that Y^i=i Vi ® Wi can ^ e reduced in tensor length, and that 
both the collections V; L and Wi are linearly independent; we shall force a contradiction. 
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Suppose we have a tensor length reduction given by the equation 

n n 

^2v i ®w i = ^2v' i ®w' i (9.1.2) 



»=i i=i 



where we let v' n = w' n = 0. The Vi,v' { and w^w^ span finite dimensional subspaces of 
V and W respectively; fix bases {ex, . . . , e m }, {/i, ...,/;} for these subspaces. Then 
write 



m I 

Vi = ^2 Ci i e i Wi = Y dik ^ k 

3=1 k=l 
m I 

3=1 k=l 



If we plug these expressions into equation 9.1.2 above and rearrange the summations 
a bit, we obtain 

Yl [ Yl Ci i dik \ e 3®fk= Y I 2 c 'ij d 'ik J e i ® h 

j=l...m \ i=l / j=l...m \j=l / 

fc=l...i fc=l...J 

By matching coefficients on the linearly independent simple tensors ej®fk, we obtain, 
for every j and k, Y^i=\ c ijdik = Y17=i c 'ijd'ik- But the left hand side is the (j, k) th entry 
of the matrix (cjj) T '(dy) , and the right hand side the (j, k) th entry of (c£-) T (d^-). Thus, 
equation 9.1.2| is equivalent to the matrix equation 



(%fK) = (^) T «,) 

Now since the V{ sit in an m-dimensional space and the wi sit in an Z-dimensional 
space, and since we are assuming both the v i: Wi to be linearly independent, we 
conclude that n is no bigger than either m or I. Further, the linear independence 
of the Vi is equivalent to the linear independence of the row vectors of the matrix 
(cjj), i.e. the column vectors of (cjj) T . Similarly the row vectors of (dij) are linearly 
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independent. This means that the matrix (cjj) T has fullest possible rank, namely 
n; for the same reason (dy) has rank n. Viewing the product (cjj) T (aV,) as a linear 
transformation k l — >■ k n — >■ £; m , we see that this product also has rank n. 

But we claim that (c^) T (^-) has rank less than n. The condition that v ' n = w' n = 
forces the matrix (c^) T to have a column of zeroes at the far right, and (d'^) to have 
a row of zeroes at the bottom. Then (c^) has less than n non-zero column vectors, 
and so has rank less than n; similarly (dy) has rank less than n. Then clearly also 
must their product. 

We conclude then that (cij) T (dij) = (c^) T (<i^) have different rank, a contradiction. 
This completes the proof. □ 

Corollary 9.1.5. Let ip : V — )■ W be an injective mapping of vector spaces, and let 
v G V (g) V. Then if (ip ®ip){y) G IV ® can &e written as a sum of no more than 
m simple tensors, so can v. 



Proof. Write v = X^ILi v i® v 'i as a minimal sum of simple tensors, so by lemma 9.1.4 ;he 
Vi and the v[ are linearly independent. As ip is injective, the collections ip{v,j) and ip{ v i) 
are also linearly independent. Then the expression (if) <g) ip){y) = Y^i=i ^{ v i) ® i^Wdi 



again by lemma 9.1.4, is of minimal tensor length in W Cg> W. □ 



We can now prove the key fact which allows us to define a natural coalgebra 
structure on A R . 

Proposition 9.1.6. If a G A — liinL^ has rank no greater than m, then A(a) can 
be written as 

m 

A(a) = 2_. h ® c i 

i=l 

where each hi and C{ themselves have rank no greater than m. 

Proof. Let A be the coalgebra structure on liniL^, A' that on (h_mLx)°, and let 



(a) = a. As a has rank no greater than m, proposition 9.1.3 tells us that A' (a) can 
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be written as a sum of no more than m simple tensors. As <fi : HmL^ — > (hmLx)° is 
injective, and since is a map of coalgebras, we have 

(0®0)(A(a)) = A' (a) 

which, by the previous corollary, shows that A (a) is a sum of no more than m simple 
tensors. Then write 

n 

A(a) = bi <S> Ci 
i=i 

where n is minimal, and in particular so that the bi and the q are linearly independent, 
and so that n < m. We claim that all of the bi and c$ have rank no greater than m. 

Suppose not, and say b± has rank greater than m. Let / < l^mLx be an ideal of 
codimension m killed by a. Then <fi{b\) cannot kill all of /; lets say it doesn't kill 

fel. 

Now since the q are linearly independent, so are the <p(ci). Then we can find lin- 
early independent vectors Vi,...,v n G limLx such that <fi(c,^(vj) = dij (lemma 1.5.8 
of [1]). Then (under the isomorphism (limLx)° ® (lhri-^x) — (hmL^ <8> limLx)°), 
Ylii <M^) ® 4>{ci) does not kill the element / <S> V\. 

But A' (a) does kill / <g> v\\ its action is given by the composition 

lim <g> hm Lx — -» hm 

and as / is an ideal, f ® V\ gets mapped into I under mult, and as a kills /, A' (a) 
kills / ® v\. But this is absurd, since A' (a) = £V 0(a») <8> 

Thus 0(6i) cannot have rank greater than m, and the same argument obviously 
applies to all of the 6j and q. This completes the proof. □ 

Now let (Ai,Ai,6i) be an indexed collection of Hopf algebras over the fields ki, 
U an ultrafilter on /, [j u i4j the ultraproduct (as a vector space) of the A iy and 
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let (pi : Ai — > (limLx)° be the map defined by diagram 9.1.1 for each Aj. Define 
the subset A R C Ylu Ai ^° cons i s t exactly of those elements [etj] G n«^« suc b that 
rank(aj) (defined by each 0j) is bounded (equivalently, constant); we call this subset 
the restricted ultraproduct of the Hopf algebras Aj. We show now that Ar can 
be given the structure of a coalgebra. 
Consider 



HA, ® Ai 



4 



A R ® A K 



We want to show that there is a A making this diagram commute, which is simply the 
assertion that the image of [AJ, when restricted to Ar, is contained inside Ar ® Ar. 
For [ai] G A^, say with rank m, write 



where we can take m to be constant over z by proposition 9.1.6 this element is in the 
image of $. Pass it down through to 



En 



Again by proposition 9.1.6 we can take all of the bij and Cj,- to have rank < m, showing 



that this expression is in fact contained in Ar ® A.r. Thus A does indeed exist. 
We define a co-unit map e from Ar to m the obvious manner, as the com- 
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position 



We forego the proof that (Ar, A, e) satisfy the relevant diagrams making it a coal- 



gebra. There are two diagrams to check, namely diagrams 2.1.1 and 2.1.2 but these 



follow from the almost everywhere commutativity of these diagrams with respect to 



each (Aj, Aj,£j), by application of the 'if direction of proposition 6.2.5 (which holds 
even in the non-boundedly finite dimensional case), and the naturality of $. 

We have not yet proved that Ar is a Hopf algebra, that it is closed under multipli- 
cation and can be given an antipode map. This will follow later as we show that Ar 
is coalgebra isomorphic to a Hopf algebra, namely A^, the representing Hopf algebra 
of Y[ R Covno& At . 

9.2 The Map from to A R 

Here we define the map of coalgebras from our representing Hopf algebra for 
fI fl Comod^, called A^, to the coalgebra A R defined in the previous section, which 
we will show in the next section is an isomorphism. 

Let us begin by fixing some notation. I is an indexing set, hi is a collection of fields 
indexed by /, k is the ultraproduct of those fields, (Ai, \,£i) is a collection of Hopf 
algebras over those fields, Ci is the category Comod^^ is the fibre (i.e. forgetful) 
functor on each d, C is the restricted ultraproduct of the categories Ci, and u is the 



fibre functor (as defined in theorem 7.2.4) on C. For an object X of Ci, Lx is the 



image of End(uJi\(X)) inside End(wj(X)), and similarly for an object [Xi\ of C, Lyxi 
is the image of End(a;|([Xj])) inside End(a;([Xj])). 
Each Ai can be recovered as 

Ai = lim L° x . 
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and we write the corresponding direct limit diagram as 





(9.2.i; 



Note that we have used the same symbol <fi for the several such existing in each 

category; no confusion should result. 

We also have, in each category, the inverse limit diagram 





1 1 



and again we have used T to stand for the transition maps in all of the categories. 
We also have the unique map 0« making 




commute, as defined by diagram 9.1.1 As is by definition the representing Hopf 
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algebra of C, it can be recovered as a direct limit according to the diagram 

c 





(9.2.2) 



where the direct system is, as usual, the objects of the category C, with [Xj] < [Yj] 
meaning that [Xj] e (\Xi\), and the 4'[Y i },[z l } being dual to the transition mappings 

Proposition 9.2.1. Let [X;],^] be objects of C with %] e ([X]). 

L[Xi] = Tiv/Lxi under the isomorphism End k (u([Xi])) ~ Y[ u End ki (u i (X i )) 

3. The transition mapping XJyjyXj] : ^[Xi] L^] can be identified with the ultra- 
product of the transition mappings, [Tx if yJ : Ylu^Xi ~^ Ylu^Yi 

4- The natural L^x^-module structure on u;([Xj]) can be identified with the ultra- 
product of the module structures on Ui(Xi) 

5. The natural L° x ycomodule structure on u;([Xj]) can be identified with the ultra- 
product of the L° x .-comodule structures on Ui(Xi) 

Proof. Let [Xj] have dimension n. To prove the first claim, we work through the 



characterization of £[x,] given by theorem 3.4.3 We start by fixing an isomorphism 



a : k n Lo([Xi])*. As u;([X;])* = (Uu u i( x i)Y can be identified with [7 W ^PQ*> a 
can be uniquely written as [atj : fc" — > Wj(Xj)*]. For our ip we may as well choose 
the identity map [Xj] n — > [Xi] n , since any subobject factors through it; then ip is the 
ultraproduct of the identity maps [ip l = lj] : [X 4 n ] — > [X™]. It is also clear that the 
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map if) a : a;([Xj] n ) — > w([Xj])* <8> w([Xj]) can be identified with the ultraproduct of 
the maps i^f" : Wj(Xj) n — > Wj(Xj)* g) Wj(Xj), if we allow a factorization through the 
isomorphism fL^iM* (gu^pQ) ~ (EL^PQ))* ® fL^M- 

Next we are asked to find -Pp^p the smallest subobject of [Xj] n such that -0^, ((^(.P^^ ) ) 
contains id : w([Xj]) — > w([Xj]). PfxA * s an object of n^i; an d can De written 
as P^ Xi ] = Wi] f° r some collection Yi of objects of Cj. We claim = P^. for al- 
most every i. We can identify the element id G u;([Xj])* £g> u;([Xj]) with the element 
[idj] G riw to '*(^)* ® an d the concepts of "smallest" and "subobject of" are 

both first-order. Thus the following statements are equivalent: 

1. [Yi\ is the smallest subobject of [Xj\ n such that ^ a {uj{\Yi\)) contains id : w([Xj]) — > 
"([*<]) 

2. For almost every z, Yi is the smallest subobject of Xf such that ^.(^(l^)) 
contains idj : Wi(Xj) —> Ui(Xi) 



Then we must have Pr^-.i = [P^], whence 



u u 

and claim 1. is proved. 

Claim 2. is immediate, as the taking of duals is known to distribute over ultra- 



products for boundedly finite dimensional collections of algebras (proposition 6.3.2). 



For claim 3. we note that since [Xj] G ([^]), Xi G (Yi) for almost every i (lemma 



9.2.2), and so Tx it Yi is defined for almost every i. To prove the claim we look to the 
definition of the transition mapping. Let \Yj\ G ([XJ), and suppose for example that 
[Yi\ is a subobject of [Xj], under the map [tj. By 1. above, every member of LpQ] is 
of the form [fa], where 0j G Ljsq for almost every i. Then the image of [fa] under the 
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transition mapping Tjy^jXi] is the unique map [<7j] that makes 



it, 



u u 



commute, which is equivalent to the almost everywhere commutativity of 



which is equivalent to <Ji = Ty^Xii^i) f° r almost every i. Thus Tp^.pCi] ([<&]) 
[TYi,Xi(4>i)], and claim 3. is proved. 

Claim 4. is merely the statement that, for [0j] E LpQ] and [xj\ E u;([Xj]), 
which is true by definition. Claim 5. is similarly proved. 



□ 



Part 2. of the above proposition tells us that, instead of the direct limit diagram 



9.2.2, we can write instead 



C u 





with the understanding that <\>\z^ is factoring through the isomorphism L° z , ~ Y\ U L° Z .. 



Lemma 9.2.2. If [XJ, %} are objects of C, and if [X t ] E (\Y$, then X, t E (Yj) for 
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almost every i. 

Proof. All of the concepts "is a subobject of", "is a quotient of", and (for fixed n) 
"is isomorphic to an n-fold direct sum of" are first-order statements in the language 
of abelian tensor categories. To say that [Xj\ G (\Yi\) means that, for some fixed n, 



[Xj\ is a subobject of a quotient of \Yj\ n . Apply theorem CO. 15 to see that the same 
must be true for almost every i. □ 



Now, let us take diagram 9.2.1 and apply ultraproducts: 





As it stands this diagram is a bit nonsensical: P7 ;v lini c L° x . is little more than a 
set, being the ultraproduct of a collection of Hopf algebras, and lacking any kind of 
coalgebra structure. We claim however that each of the maps [0yj have their image 
inside A R C ri M ^« = T"Lli m c L° Xo the restricted ultraproduct of the Hopf algebras 

A. 

Proposition 9.2.3. The image of each for [Yi] G C, is contained inside Ar. 

Proof. Consider, for fixed i, the diagram 
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We claim that if Y; L has dimension n, then for any a G L Y . , 0y (a) has rank no larger 
than n 2 . Commutativity of the above gives 



<M0y t (a)) = T Y .(a) 
which is equal to the composition 




Ci 



Now Ly, being a subalgebra of Endv CCfe . (^>i(Yi)), certainly has dimension no larger 
than n 2 ; further, the kernel of Ty oa contains the kernel of Ty. But Ty is an algebra 
map, and so its kernel is an ideal of limLjsQ, and has codimension no larger than n 2 . 
Thus 4>i{4>Yi («)) has rank no larger than n 2 , hence by definition 0y(a) has rank no 
larger than n 2 . 

Then if [Yj] G C, say of constant dimension n, and if [ctj] G n w ^X;> then [0y]([aj]) == 
[0y(ai)] has bounded rank, each being no larger than n 2 ; thus it is contained in 
A R . □ 

We now have two diagrams 




where, in the second diagram we have replaced n«!ilSr L° x with A R , as we may by 
the previous proposition; some routine arguing shows that since 0y : L Y . — > Ai is a 
coalgebra map for every i, then [0y] : riw^y ~~ -^fl i s & l so a coalgebra map. Some 
care must be taken here; on the right, the map [0y,zj is defined whenever Yi G (Zj) 
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for almost every i, while on the left, it is only defined when [Yj] e ([Zj]). Nonetheless, 



by lemma 9.2.2 , whenever it is defined on the left, it is defined on the right. We 
can now appeal to the universal property of direct limits to invoke the existence of a 
unique coalgebra map Q making the following diagram commute: 




This Q : — > Ar is our claimed isomorphism of coalgebras, later to be shown, of 
Hopf algebras. 



9.3 Q is an Isomorphism 

This Q, while difficult to define, is not that difficult to describe. A typical element 
of = hmriw-^x, looks like 

[[Oj : L Xi -> ki]u)c 

That is, it is an equivalence class of equivalence classes of linear functionals, with U 
denoting the equivalence defined by the ultraproduct with respect to the ultrafilter 
U, and C denoting the equivalence defined by the direct limit over C. Each is an 
arbitrary linear functional, subject only to the restriction that the objects have 
bounded dimension. 
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A typical element of A R C f| w limL^. on the other hand looks like 



[[«» : L X t ->■ fci]Ci]« 

where C« denotes the equivalence defined by the direct limit over each d. The Xj 
here are not assumed to have bounded dimension; only that the functionals ctj have 
bounded rank. 

The action of is simple then: 

n : [\a t : L Xi -> h} u } c i— )• [[a, : Lx, ->• /c,] c J w (9.3.1) 

To see it this way it is not at all obvious that it is well-defined, or that it is a map of 
coalgebras, but we know it is, via the way we constructed it. 

Lemma 9.3.1. Let C be the category ComodA, and let [a : Ly — > k] be an element of 
lini c L° x which has rank no greater than m. Then [a : Ly — > k] can be written as 

\d:L c ^ k] 

where C is some subcoalgebra of A having dimension no greater than m. 

Proof. Y is in the principal subcategory generated by some subcoalgebra C of A, and 
so we have the map L Y ^ Y ' c > L° c \ this shows that we may as well take [a : Ly — > k] 
to be [7 : L c — > k] for some 7 G L° c . To say that this element has rank no greater 
than m is to say that the composition 




c 

kills an ideal I < limLx of codimension no greater than m. Let J be the kernel of 
T c . We can assume that I contains J; if not, enlarge I to / + J, which is still an 
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ideal contained in ker(Tc< o 7) having codimension no larger than m. Then we have a 
commutative diagram 



c 



T C 



■>c 




{\mvL x )/I 

c 

where 7r is the natural projection, and 7r' is the unique surjective map gotten because 



J C I. As Q^mL x )/I is a quotient algebra of L c , theorem 8.1.2 guarantees that it is 
isomorphic to for some subcoalgebra D of C, and that under this identification we 
can take tc' to be the transition map T^c- Thus we have the commutative diagram 




with Ld of dimension no greater than m. And since ker(7) D Tq(I) = ker(To ) c) 
there exists a linear functional /3 making 



Td,c 




commute. By definition then, 7 and (3 are equal in the direct limit. We can thus 
write 

[7 : L c -)• k] = [f3 : L D :-)• fe] 
Finally, since has dimension no greater than m and D is a coalgebra, theorem 
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8.0.6 tells us that D ~ L° D has dimension no greater than m. This completes the 



proof. □ 
Proposition 9.3.2. The map Q is surjective. 
Proof. Again, a typical element of Ar looks like 



[[on : L Xl -> fei]c! 



with the ojj having constant rank, say m. Then lemma 9.3.1 shows we can write this 
instead as 

[[A : -> Nd« 

where each Di is a subcoalgebra of Ai having dimension no larger than m. Then the 



formula given for VL at the beginning of this section (equation 9.3.1 ) shows that 



[[Pi ■ L Di ki] u ]c 

qualifies as a pre- image for our typical element under Q. □ 

Lemma 9.3.3. Let G be an affine group scheme represented by the Hopf algebra A 
over a field k. Let (V, p) be an n-dimensional A-comodule, fix a basis e\, . . . ,e n for V , 
and let (ajj) be the matrix formula of the representation of G it defines in that basis. 
Then C = span k (aij : 1 < i, j < n) is a no more than n 2 - dimensional subcoalgebra of 
A. Further, (V,p) can be embedded, as a comodule, into C n . 



Proof. Apply the comodule identity A(ay) = ^2 k CLik <8> dkj (equation 10.0.2) to see 
that A(C) C C <S> C, whence C is a subcoalgebra of A. For the embedding claim, 
we examine the embedding V — > A n (n-fold direct sum of the regular representation) 
defined in section 3.5 of [TU]; we claim that the image of this embedding is in fact 
contained in C n C A n . Let ^ : V <g> A — > A n denote the vector space isomorphism 
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ej <g> a i-> (0, . . . , a, . . . , 0) (a in the i slot, zeroes elsewhere). Consider 



V V ® A > A n 



A" 



(A <g> A) n 



v v v 
V <g> A > V <g> A <g> A > A n ® A 



Commutativity of the left rectangle is a comodule identity (see diagram 2.2 A), and 



commutativity of the right rectangle is obvious, whence this entire diagram commutes. 
Note that the composition that starts at the top right and goes directly down is by 



definition the comodule structure on A n (see definition 2.3.3 ). Looking at a condensed 
version of the outermost rectangle 



V P — >A' 



A" 



{a ® Ay 



V g> A - ~ > A n ®A 



we see that p o \1> is an embedding of V into A™. And if we chase the basis element 
ej from V to A n we arrive at 
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which is an element of C n . 



Proposition 9.3.4. Q is injective. 



□ 



Proof. Let [[a, : L Xi — > ki]u]c an d [[A : L Yi — > k^u\c De two typical elements of A c 
such that Q maps them to the same thing. This means that 



[[on : L Xl -)• h] Cl ]u = [[A : -)• h] Ci 



u 



which is to say that, for almost every i, 



[ai : L Xi -> fci]cj = [A : ^ -> &i]c; 



which is to say that, for almost every z, there is a such that Xi, G (Zj) and 




8 5 l 



commutes. Now the Zj are, as far as we know, not of bounded dimension, so we have 
some work to do. By lemma |9.3.3[ for each % let Cj be a subcoalgebra of A4 such that 
Ci has dimension no larger than dim(Xj © Y) 2 , and such that Xi © Yi is embeddable 



in a 



dim(Xi®Yi) 



Note in particular that this implies that both [Xi] and [Yj] belong 



to the principal subcategory generated by [CJ (since they are both subobjects of a 
subobject of [Cf 1 ] = [Ci] m for a fixed m). 

For each i let .Dj be a subcoalgebra generating all of the Xi, 3^, Zj and containing 
Cj, which of course we cannot assume is of bounded dimension. Then for every i we 
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have a commutative diagram 




and in particular, the outermost diamond commutes: 




and hence so does 




As Cj C Di are subcoalgebras, proposition 8.1.1 tells us that Tq d is surjective. Then 



commutativity of the above gives T Ci) D l ° T Xu d ° a i — Tc u Di ° Ty it d ° A> an d since 
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Tc i} Di is surjective, this gives us commutativity of 




Now apply ultraproducts to yield a commutative diagram 




Note that [CJ, being of bounded dimension, is an object of C. Then if we identify 



Y\ u L Ci with L[ Ci \, [T Xl ,c t ] with T[ Xi ],[Ci], etc. (as we may by proposition 9.2.1), com- 



mutativity of the above implies the equality of [ai\ and \J3i\ in the direct limit over C; 
that is 

[[on : L Xl -»• h)u]c = [\Pi ■ L Yl -»■ fo]w]e 



as desired. 



□ 



Theorem 9.3.5. The representing Hopf algebra of the restricted ultraproduct Y[ R Comod^ 
is coalgebra-isomorphic to the restricted ultraproduct Ar of the Hopf algebras Ai. 



Proof. Apply propositions 9.3.2 and 9.3.4 



□ 
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9.4 The Equivalence J \ U C< c± Comod^ 

For a collection of Hopf algebras Ai, the previous section shows that Ar is coal- 
gebra isomorphic to A^, the representing Hopf algebra of J| Comod^. Then as A^ 
is a Hopf algebra, so is Ar, under whatever multiplication and antipode map are 
induced on it by fl We would like to of course prove that this induced multiplication 
and antipode are exactly those inherited by being a subset of i.e. that they are 

the ultraproduct of the individual multiplications and antipodes on the restricted 
to A R . 

We will prove this for multiplication; we do not prove it for antipode, but believe a 
similar proof to the one we give for multiplication (using instead the dual construction 
instead of the tensor product) could be constructed. 

Our first step is to build the equivalence from the category Yl R Ci to Comod^ 
induced by the isomorphism f2; here our work will finally start to pay off, as this 
equivalence is quite natural and easy to describe. Examination of this equivalence 
will further yield the required multiplication on Ar, as we examine the tensor product 
on Comod^ induced by this equivalence. 



First, following the construction mentioned in theorem 3.3.4, we build the equiva- 
lence G : rj H Comod J 4 i — > Comod^. To keep notation simple we use the same symbol 
X for an object of Comod^ and its image under the fibre functor, and similarly for a 
morphism. 



Let [Xi, pi : Xj — >■ Xj® Ai] be an object of n,R^- The remarks before theorem 3.3.4 



tell us we should define the Aoo-comodule structure on [j^Xj to be the composition 



YiXi ^ x\Xi ® L\ Xi] m4>[Xi] ) iUXi) <g> A c 



9.2.1 



says 



where p' is the natural L? x ,-comodule structure on n^^- -^ u ^ proposition 
that we can replace L? x , with n«-^Xi' anc ^ i n so d°i n g can define p' in terms of the 
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individual L^.-comodule structures on each X iy whom we call p\\ that is 



® yilv (rw ® a 





commutes. The Aoo-comodule structure on n w ^> can ^ i s thus the composition 



ux t ^ ® ^ ® rra, > (ipq) ® ^ 



G([Xi]) is thus ri M ^5 w ^ n the above comodule structure. For a morphism 
: Xi ->■ Fi] in ]^[^Ci, we define of course G([ifji\) to be [ipi] : flw^ ~^ Tlu Y i- 
The next step is to pass to the isomorphism Q to obtain an equivalence of cate- 
gories Yl R Ci ->■ Comod^. 

Theorem 9.4.1. Define a functor F : ~~ Comod,A R as follows. F sends the 

object [Xi, pi : Xj — > Xj(g)Aj] to the vector space Y\ u Xi with the A R -comodule structure 



Y\x t -^K Y\Xi ® A ^> Y\Xi ® nA ^ (EM ® A fl 

u u u u u 



and sends the morphism [ip : — > Yj\ to [ipi] : riw^ ~~ y Yiu^- Then F is the 
equivalence of categories induced on G by Q. 



Proof. Consider the diagram 



Ux t -M i\x t ® l° ^> ]\x t ® (iiXi) ® a c 



i<s>r2 




nx ® ^ — > ® 




(rpQ)<E>A R 
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The composition that starts at the top left, goes all the away across, and then down, 
is the functor gotten from G by Q; that is, the top line is the Aoo-comodule structure 
on [Xj] under G, and then we tack on 1 <g> Q to obtain an /L^-comodule structure. 
The composition that starts at the top left, goes diagonally down, and then all the 
way across, is the composition referenced in the statement of the theorem. We want 
to see then that this diagram commutes. Commutativity of the left-most triangle is 
equivalent to the almost everywhere commutativity of it, which in turn is simply the 
statement that the Aj-comodule structure on Xi can be factored through the L° x - 
comodule structure for it, and through the canonical injection §x v The next square 



follows automatically from the naturality of $ (proposition 6.2.14). Commutativity 
of the next triangle is obvious, as is known to point to A R , and the last triangle 
follows from the definition of Q. The theorem is proved. □ 

Thus we have an equivalence of categories F : Y\ R Ci — > ComodA fl given by the 
previous theorem. As Yl R Ci is a tensor category under £g>, it induces a similar structure 
on Comod^ through F, which we call Cg>, whose action is given as follows. Any two 
objects of Comod^ pull back under F to objects of Y\ R Ci which look like 

[Xi,pi : Xi -> Xi® Ai\ 

[Yi,m : Yi ->■ Yi® Ai] 
Their tensor product in Y\ R Ci is defined as 



and we push this new object back through F to yield a new object in ComodA fl , having 
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underlying vector space n«^j ® ^ an d comodule map given by the composition 



Y\x t ® y ) ® Ai ® y <g> A [10Tl01] ) n^< ® ^ <e> Ai <g> a 

WW W 



Playing the same game we see that two morphisms in Comod^ pull back to mor- 
phisms [tpi : — > V$], : Yi — > Wj\ in fJ^C^, and upon taking their tensor product 
in Y\ R Ci and pushing them back through F we obtain the image under <g> of these 
morphisms: 

iixt ® Yi ) m ® Wt 

u u 

Now let us modify <g> a bit; simply tack on $ to both ends of the above to yield 



fjX, ® Fj > ® ^ ® V; ® A, [1 ® Tm ) ftXi ® Y; ® A ® A 

u u u 

J^l^L^ Y\x t ® Yi ® a, ^4 (i\x t ® y) ® nA d opq®^)®^ 



^ ® m ® ^ 



u u 



and instead of [ipi ® , write 



The naturality of the isomorphism $ guarantees that this new functor is naturally 
isomorphic to ®; let us relabel this new functor as ®. 

The next proposition simplifies the description of ®, one which doesn't require 
first pulling an object back to HU^i- 

Proposition 9.4.2. If(X,p), (Y, /i) are objects of ComodA R , then ® sends this pair 
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to the vector space X <g> Y , with comodule map given by the composition 



X <g> Y ^ X <g> A R <g> Y <g> A R C X <g> n A ® V <8) II A 

x®y®I]^^ IIA 101 "> x y IIA d * <g> y <g> A fl 



where mult denotes the natural coordinate wise multiplication on Yl u A- 

Proof. As F : Yl R Ci ~~ ComocU^ is an equivalence, there is an object [(Xi, pi 
W R Ci such that X = Yl u Xi and p is equal to the composition 



Y\x t ® a ^ im ® n a im ® ^ 



www 



and similarly for (Y, //). Consider 
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U U 



Y\x t <s> Ai<s> Yl Y i ® A 



A 



[pi 1 



nx ® ® n** ® riA <-> iw ® a ® ^ ® a 

W W W W W 

A A 



[l®Ti5 



x <g> A fi <g> y <g> A 



nx ® a r ® ® a r u x i ®Yi®Ai®Ai 

[l(g>l(g>multi] 

/ 

x ® nA ® ^ ® nA = im ® nA ® ® n^v£ n-** ® ^ ® ^ 

l®T<g>l 



w u u u 

A 



l®T(g>l 




x ® y ® Y[Ai ® = iw ® u Y i ® ® riA iw ® ^ ® 



W W W U 



l®l®mult 



x ® y <g) nA 




l(g>l®mult 



x ® y <g> a 



ft 



nx ® ® n ^ 



www 
A 



(n^i ® Yi) ® a r 



rpQ®ro®^ 



which the author would at this time like to nominate to the Academy as the ugliest 
diagram of all time. The top left rectangle commutes by the previous remarks, and 
all of the other simple sub-polygons, though numerous, are easy to check; thus this 
entire diagram commutes. The composition that starts at the top right and goes all 
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the way down is the image of (X, p) and (Y, p) under <g> as described previously; the 
composition that starts at the top left and goes all the way down is the composition 
given in the statement of the proposition. These are equal, and the proposition is 
proved. □ 

So then, we have a bifunctor £g> on Comod^ which sends two comodules to a 
new comodule whose underlying vector space is the tensor product of the underlying 



vector spaces of the comodules. Proposition 3.3.5 tells us that this functor is induced 
by a unique /c-homomorphism u : Ar <g> Ar — y Ar] that is, Cg> sends the objects (X, p) 
and (Y, p) to X ® Y, with Afj-coniodule structure given by the composition 



X g> Y X ® A R ® Y ® A R > X ®Y ® A R ® A R 
y X ®Y g) A R 



We claim that this u is nothing more than the natural multiplication on n^^i re_ 
stricted to Ar. 

Lemma 9.4.3. Let (C, A) be a coalgebra. Define the subset S of C to consist of 
those elements a G C with the following property: there exists a finite dimensional 
comodule {X,p) over C such that, for some element x G X, p(x) = ® a i> with 

the Xi linearly independent and a = ai for some i. Then S spans C . 

Proof. Let c G C, and C'cC the finite dimensional subcoalgebra it generates. Write 
A(c) = Y2i a i ® fr« with the a, linearly independent. Then obviously all of the 6j are 
in S if we view C as a finite dimensional comodule over C. Apply the coalgebra 
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identity 



C®C 




=01 



k®C 



to yield c = J2i £ ( a i)bi, showing c to be in the span of the bi 



□ 



Theorem 9.4.4. u is equal to the natural multiplication on Y\ U A restricted to Ar. 
Proof. Let (X,p),(Y,p) be finite dimensional comodules over Ar, and consider 



X® Y 



pOSfl. 



x ® a r ®y ® a r ^ x ® n a ® y ® HA 



1®T®1 



x ® y ® n a ® n a 



x ® A fi ® y ® a 



1®T®1 



x ® y g> Ar <g> A R 



l®l<gimult 



x ® y g) nA 



x ® y <g> a 



The composition that starts at the top left, goes across, and then all the way down is 



the image of the pair (X, p), (Y, //) under g) as proved in proposition 9.4.2 The one 



that starts at the top left, goes down, and then across is the bifunctor induced by u\ 
this diagram commutes by assumption. 

In the notation of the previous lemma, let a, b G S C Ar. This means that 
there is an (X, p) such that, for some x G X, p(x) = J2i x i ® a ? with the 2j linearly 
independent and a = a, for some i. Similarly, there is a (Y, p) and y G Y such that 
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^(y) = J2j Vj ® with the yj linearly independent and b = bj for some j. Now if we 
chase the element x <8> y in the above diagram around both ways, it gives 

eg y_j eg) ti(a, Cg 6j) = ® j/j Cg mult(aj <S> bj) 

Since the x,- t ® y 3 - are linearly independent, by matching coefficients we must have 
u(dj Cg 6j) = mult(aj Cg bj) for every i and j; in particular, u(a <g) b) — mult(a Cg 6). 

Thus we have shown that u and mult are equal on elements of the form a Cg b, 
with a,b E S. But the previous lemma shows that S Cg 5 spans Cg Ar. As both u 
and mult are /c-linear, they must be equal everywhere. This completes the proof. □ 



9.5 Examples 

9.5.1 Finite Groups 

Let G be a finite group defined over Z, A its representing Hopf algebra, ki a 
sequence of fields, A4 = A <g> ki = the representing Hopf algebra of G over ki, and 
Ci = Comody^. We claim that the representing Hopf algebra of Y\_ R Ci is nothing more 
than A Cg [] M ^. The following observation shows this not to be surprising. 

Proposition 9.5.1. The category Y\ R Ci is equal to the principal subcategory generated 
by [A®ki\. 

Proof. By [A ®ki] we mean the object of Yl R Ci consisting of the regular representation 
of G over k in every slot; it is clearly of constant finite dimension. Let [Xi] be an 



object of rifl^) sa Y °f dimension n. Then by theorem 2.2.2 each X{ is a subobject of 
a quotient of (A eg ki) n ; this is a first-order statement, and so [Xj\ is a subobject of 
a quotient of [A eg ki] n , showing [Xj\ to be in the principal subcategory generated by 
[A® hi]. □ 
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Proposition 2.20 of [5] tells us that the property of being singularly generated in 
fact characterizes those neutral tannakian categories whose representing Hopf algebra 
is finite dimensional. In fact we can identify this Hopf algebra as End (a; | ([A £§> ki\))°, 



since [A ® ki] generates the entire category. Then by proposition 9.2.1 and theorem 
8.0.6| we have that Aoo, as a coalgebra, can be identified as 



A 00 =End(u\([A(S)k l ]))° ~ U^iuMA^h)) = ]\A®h 

u u 

Thus Aoo is equal to the full ultraproduct of the A® ki. Note that in this case Ar, 

the restricted ultraproduct of the A £g> ki, is in fact equal to the full ultraproduct, 

since the A £g> ki are of constant finite dimension. 

Since the A®ki are of constant finite dimension, we have an isomorphism n«^® 
— Ylu^ ® rL^; ^ * s n °t hard to see that the latter can be further identified as 

A <S> Ylu as an algebra, coalgebra, and indeed Hopf algebra. 

Thus, for G finite, the category n fl Rep fc .(jr can be identified with Rep nwfc .G. 

9.5.2 The Multiplicative Group 

Here we compute Aoo for the multiplicative group G m using the work done in this 
chapter. Let Cj = Rep fc .G m with ki being some sequence of fields. Let Ai denote the 
representing Hopf algebra of G m over the field ki, which we identify as 

Ai = ki [x, x ] 
Aj : x (-> x <S> x 
mult : x r ® X s h-> x r+s 

We know then that A^ is isomorphic to A R , the restricted ultraproduct of the Hopf 
algebras Ai, which we set about now identifying. 
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Fix a field k, and let A = k[x, x' 1 }, with A, mult denned as above. Then A can be 
realized as the increasing union of the finite dimensional subcoalgebras B C B 1 C . . ., 
defined as 

B^i — — sp£iii^ {x ; X ^ ^ , . . . , X ; X , X j X j ... j X ) 

The dual algebra B* to B n we identify as 

B* = span fc (a_ n , . . . , a , . . . , a n ) 
mult„ : a r <8> a s 5 rs a r 

As the B n form a direct system under the inclusion mappings, the B* n form an inverse 
system under the duals to these inclusion mappings. This map B* n «— B* +1 is given 
by, for a r : B n+X — > k, the image of a r is a r if |r| < n and otherwise. Then we leave 
it to the reader to verify 

Proposition 9.5.2. The inverse limit l^im B* of the B* can be identified as span k (cti : 
i G Z), with mult defined by 

mult : a r <8> a s >->■ 5 rs a r 
and i/ie canonical mapping T m : hrn 5* — >■ -Bj^ gwen fry 



a r i — ^ 



a r i/ |r| < m 
otherwise 



Next we must identify the map : A — > (HmB*)° giving us the notion of 'rank' in 
A. For x r G A, we pull it back to x r G -B m for some m > \r\, pass to the isomorphism 
£? m ~ B™, and then up through T^. Thus we obtain 

Proposition 9.5.3. The linear functional 4>(x r ) acts on hjn-B* by 
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(f)(x r ) : a s ^ 5 rs 

Let / = c\x mi + C2X 1712 + . . . + c n x mn be an arbitrary element of A, with rrii G Z 
and Q G k. Then </>(/) kills the ideal 

I = span fc (a r : r G Z — {m 1; m 2 , . . . , < lim B* 

This ideal is the largest ideal that <f){f) kills and it has codimension n. Therefore 

Proposition 9.5.4. The rank of the element f = C\X mi + c 2 x m2 + . . . + c n x m ™ G A 
is equal to n, the number of distinct monomials occurring as terms. 

Let Ai = ki[x, x~ l ] be the representing Hopf algebra of G m over the field k{. 
According to the previous proposition, the rank of a polynomial in Ai is equal the 
number of monomial terms occurring in it. Thus, in order for an element [/;] G EL^* 
to have bounded rank, it is necessary and sufficient for it to have almost everywhere 
bounded monomial length. If this bound is n, then in almost every slot f\ has length 



among the finite set {0, 1, . . . ,n}, and so by lemma B.0.10 we may as well assume 
that the /j have constant length. Then we have 

Proposition 9.5.5. The restricted ultraproduct Ar of the Ai can be identified as 

Ar = {[fi] G Y\Ai : fi has constant length } 
u 

A : [/,] ^ [AiC/O] 
mult : [fi] <g> [gi] h-» [multi(fi ® 9i)} 

Let us find a tighter description of Ar. Let Ylu^ denote the ultrapower of the 
integers and let A' denote the k = Y[ u ki-span of the formal symbols x' 2 ^, [z;] G Ylu^- 
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Define the following Hopf algebra structure on this vector space as follows: 

A' = span fe (x [Zl] : [ Zi ] G 

u 

A : x [z ' ] i-> x [Zl] <g> x [Zl] 
mult : x [z > ] <g> x [m] i—)- 
e : rr [z!] M- 1 
5 : x [2l] i-» x [ - 2l] 

Now every element of Ar looks like 

[a\x z * + ... + afx^} 

with a\ G fcj for every % and j, and G Z with 2;/ < zf < . . . < z™. Then define a 
map Ar to A' by 

[a\x$ + ... + afx z ?\ ^ [a]\x^ + ... + [a™]x [z ^ ] 
We leave it to the reader to verify 

Proposition 9.5.6. The map just defined is an isomorphism of Hopf algebras. 

Finally, let us build the equivalence of categories Yl R ^i ~^ Comod^ using the 
description of Ar given above. Let [Xj,pj] be an object of \\ R Ci of dimension m. 
It is well known that any module for G rn over a field is simply a diagonal sum of 
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characters; that is, in some basis, it has matrix formula 



\ 



x 



\ 



X 



J 



for some collection of non-negative integers zi, . . . ,z m . For each i then, fix a basis 
e}, . . . , e™ of Xj for which the action of G m is a diagonal sum of characters. Then we 
can write 



Pi : e\ i-> <8> x 2 



The vectors [ej], . . . , [e™] form a basis for n w -^i> anc ^ m this basis the A/j-comodule 
structure of riw^ ls given by 

P '■ 14] *-> K] ® x [z * ] 
That is, the action of Goo = the group represented by Ar on flw^j is simply 



\ 



X 1 



X 1, 
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Chapter 10 



A Combinatorial Approach to the 
Representation Theory of 
Unipotent Algebraic Groups 



As promised in the introduction, for the next several chapters we take a break 
entirely from working with ultraproducts, and instead focus on working out the con- 
crete representation theories of certain unipotent algebraic groups. Here we outline 
the approach we will be taking for all of the proofs constructed throughout. 

Let G be an algebraic group over the field k with Hopf algebra A — k[xi, . . . , x n ]/I, 
where I is the ideal of k[x\, . . . ,x n ] generated by the defining polynomial equations 
of G (e.g., if G = SL 2 , then A = k[x±, x 2 , x 3 , x^]/ (x±X4 — x 2 x 3 — 1)). If (V, p) is a 
comodule over A with basis e 1? . . . , e m , we can write 



where c[- is a scalar for every i,j and f, and the summation runs over some finite 




E 4 



Xn 

n 



r=(n,...,r„) 
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collection of n-tuples of non-negative integers. If for each r we think of ( 
an m x m matrix over k, its significance is that it consists of the coefficients of 
the monomial x^ 1 . . . x 7 ^ 1 in the matrix formula for the representation in the basis 
ei, . . . ,e m . For example, consider the group G a x G a , where G a denotes the additive 



group (see chapter 12). G a xG a has as its representing Hopf algebra k[x, y\. Consider 
the representation defined by 



1 1 2y + x 2y 2 + 2yx + \x 2 ^ 








1 





2y + x 
1 



Then in this basis the (cjj) matrices are given by 



M (0 ' 0) 



^ 1 ^ 



1 
1 



(Qi) (1 ' 0) 



1 
1 




\ 



{Cij) 



(0,1) 



^ 2 ^ 



2 




(Cij) 



(2,0) 



/ 



o o 






1 \ 



-"13) 



,(0,2) 



^ 2 ^ 








,(1,1) 



V 



2 





with (d 



for all other r. Now the two diagrams asserting that a given vector 
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space V and linear map p : V — > V ® A is & comodule over A are 




V® A 



V 



V®A 



V <g> k 



1®A 



F ® A > V ® A® A 



In the first diagram, if we chase ej along both paths we arrive at 

Cj (g) 1 = ® e(a^) 

i 

and we see by matching up coefficients that £((%) = 1 if i = j, zero otherwise. This 
simply says that 

(e( aij )) = Id (10.0.1) 

which will be some matrix expression among the (cy) matrices. For the second 
diagram we chase ej along both paths and arrive at 

and again, by matching coefficients, this reduces to 



^2a ik <g> a fci = A(a^) 



(10.0.2) 



Notice that the left hand side is precisely the entry of the matrix product (a^® 

1)(1 <g) ay). In practice these two equations will allow us to derive matrix equalities 
between the various (cjj) r , which will serve as necessary and sufficient conditions for 
them to define representations over a given group. 

This approach is particularly amenable to the study of unipotent groups. It is well 
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known that all unipotent algebraic groups (and quite obviously the ones we will be 
studying) have Hopf algebras which are algebra-isomorphic to A = k[xi,x 2 , ■ ■ ■ ,x n }; 
that is, isomorphic to a polynomial algebra with no relations. As such, the collection 
of all monomial tensors *™ form a basis for A <g> A, and a 

great deal of our work will involve looking at equalities between large summations 
in A (g) A and trying to match coefficients on a basis. Thus, for unipotent groups, a 
logical choice of basis with which to attempt this is always available. 

We would like to briefly mention that, in a round-about way, what all this amounts 
to is working with the Lie algebra of the group in the characteristic case, and the 
distribution algebra in the characteristic p > case (see chapter 7 of [13] for a good 
account of the latter). In fact, for a given G-module, these (cij) r matrices correspond 
to the images of certain distributions under the associated Dist(G) module, and these 
matrix equalities we shall be deriving essentially amount to working out the multi- 



plication law in Dist(G); compare for instance equation 12.1.2 with equation (1) of 



page 101 of [13]. We have chosen however to proceed without this machinery; it gives 
us no advantage to our purposes, and in any case puts less of a burden on the reader 
(and the author for that matter). 



10.1 Morphisms 

None of this work would be worth much if we couldn't say something about 
morphisms between comodules. 

Again let G be an algebraic group over the field k, with Hopf algebra A = 
]/I, and this time fix a monomial basis {x r ^ . . . x r ^ ■ f G R} of A. For 
brevity, for r G R, denote by x r the monomial a^ 1 . . . x r ^ . If (V, p) is a comodule over 



180 



A with basis ei,...,e n , we can write 



i=l 



°*j — ^ ] c'i 



feR 



with each c^- a scalar. Similarly let (W, //) be a comodule over A with basis fi, ■ ■ ■ , f m , 
and write 



i=l 



T r 



Theorem 10.1.1. Let <fi : V — >■ W be a linear map, and write it as the matrix (kij) 
in the given bases. Then <p is a morphism ofV and W as A-comodules if and only if 
for every f e R 

(hj)(cij) r — (dij) r (kij) 

Proof. Consider the diagram 



V 



-> w 



V® A 



W® A 



whose commutativity is equivalent to being a morphism between V and W . If we 
chase ej along both paths we arrive at 



n / m 



i=l \ s =l 



i=l \ s =l 



Replacing the a's and 6's with their definitions in terms of the c's and d's and re- 
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arranging a bit, we have 



E E J /. ® x? = E E ) /• 

S,f \ 1=1 / S,f \ 1 = 1 



As / s ® :r r for varying s = 1 . . . m and r G -R is a free basis for W <g> A, we must have, 
for every j, s and r E R 

n m 

E ^ c ij = E c ^* i ^- 



i=l i=l 



But the left hand side is the (s, j) th entry of (kij)(cij) r , and the right the (s, j) th entry 
of (dijfihj). 

□ 

A combinatorial lemma we will need later: 

Lemma 10.1.2. Let (V, p) ,(V, (/,) be two representation of G on the finite dimensional 
vector space V , given by 



p : e i *-> E ei ® = E c ^ r 

i r 

l' ■< y :> E ' ' l>: ' l) '> = E (l 



Then the matrices (a^ <g> 1) and (1 ® fry) (taking their entries from A® A) commute 
if and only if for every r, s e i? ; £/ie matrices (cij) r and (dij) s commute. 

Proof. The (i, j) th entry of (a - <g) 1)(1 <g) is 



E a ^ ® ^ = E ( E c ik x? ) ) ® ( E 4^ ) = E ( E ) ^ ® ^ 

fc fc\r / \ s / r,s \ 
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while the (i,j) th entry of (1 ® <S> 1) is 



fc A; \uecr / \ s / f,s \ k 



By matching coefficients on the free basis x r <S> x s for A <8> A we must have, for every 
i,j < n and r,sER 



k 



i.e. 

(cy) ? (dy) ? =(d«) ? W 



□ 
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Chapter 11 

Representation Theory of Direct 
Products 

Here we are interested in the following: given that one has a handle on the rep- 
resentation theory of the algebraic groups G and H, what can be said about the 
representation theory of G x HI Any representation of G x H is evidently a rep- 
resentation of both G and H in a natural way. It is also evident that one cannot 
paste together any two representations of G and H to get one for G x H; they must 
somehow be compatible. The goal of this section is to prove necessary and sufficient 
conditions for representations of G and H to together define one for G x H, and to 
provide a formula for it. 

I will spoil the suspense: two representations $ and \P for G and H on the vector 
space V define one for G x H on V if and only if they commute; that is, the linear 
maps $((?) and ^(h) commute for every pair g G G, h G H. The matrix formula 
for the G x if-module they define is nothing more than the product of the matrix 
formulas of the constituent modules. Morphisms for the new module are exactly those 
that are morphisms for both of the constituent modules, and likewise direct sums and 
tensor products behave as we hope they will. 



184 



We leave it to the reader to verify the following: if (A, A a ,ea) and (B, A B ,e B ) 
are the representing Hopf algebras of G and H, then the Hopf algebra foiGxH is 
(A <g> B,A,e), defined by 

A: A®B Aa0Ab ) A <S> A <S> B <S> B ^ Twist ^ A® B ® A® B 
e :A®B £A ® £B ) k®k~ k 

The natural embedding G — > G x H is induced by the Hopf algebra map A ® 

A ® /c ~ A, similarly for iJ. Then for any vector space V and A ® 5-comodule 

structure p: V — )• V" Cg> A eg) .B, we get an A-comodule structure on V given by the 

composition 

V -^V ®A®B mm£B ) V®A®k~V®A 

Similarly we get a 5-comodule structure by instead tacking on I® e A ®l. 

Let (V,p), (V,fj,) be two comodule structures on the vector space V, the first for 
G, the second for H. Write 

i 
i 

Theorem 11.0.3. Let (V,p), (V, fj) be modules for G and H as above. Define 

Zij = ^ a ik ® b k j 
k 

Then the map a:V^rV®A®B defined by 

a : e,j i— > ® z^ 

i 

is a valid module structure for G x H if and only if the matrices (a^) and (b^) commute 
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with one another, a restricts naturally to p and p via the canonical embeddings , and 
this is the only possible comodule structure on V that does so. 

Some explanation is in order. The matrices (a^) and (fry) take their entries from 
different algebras, so it doesn't make much sense to say they commute. What we 
really mean is that the matrix products (a y - ® 1)(1 <S> b^) and (f ® bij)(a,ij <S> 1 ), which 
take their entries from A <g> 5, are equal. The content of the theorem then is that, 
if they do commute, the matrix = (a^ ® 1)(1 <g) fry) provides a valid module 
structure for G x H, and this is the only one that restricts to p and /j. In what 
follows we shall simply write and even when we wish to consider them as 
elements of A <S> B. 

Proof. Given that this actually is a representation, the representation induced on G 
from it is given by 

ej H> ^ e 'i ® Z ij 1C ^f £S ^ e i ® ^2 a ' ik ® SB ^kj) 
i i k 

= ^2 ei ® ^2 aik ® = ^2 ei ® a ^ 

i k i 

and similarly for H. Thus a does indeed restrict to p and p. 

To prove that this actually is a representation, as always, we must check that the 
equations e(zij) = 5ij and Ylk z ik ® Zkj = are satisfied. For the first, we have 

e(zij) = e(y] a ik <g> 6 fcj -) 
fc 

= y^£A(a»fc)£i?(fefcj) 
k 

= ^2 ^ikfikj 
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as required. For the second equation, we have 



A(zij) = A(^2a ik <g> b kj ) 
k 

= A A (a ifc ) <g> A B (b kj ) 
k 

A; Z ra 
Twist ^ — \ 

— aa ® bkm ® aik ® bm i 

k,l,m 

where we have used the fact that p and p are comodule structures for G and H, and 
applied the twist at the end. Note that the last expression is equal to the (i,j) th 
entry of the matrix (a>ij)(bij)(a,ij)(bij). For the other side: 

^2 z ik ® Zkj = a ' u ® hlk> ® E ° fcm ® 

fe k l m 

= y^ y au® bik <S> a krn <g> b mj 

k,l,m 

This is equal to the (i,j) th entry of the matrix (Q>ij)(o>ij)(bij)(bij). Thus, all this 
reduces to the matrix equality 

(an ) (bij ) (a,, ) (bij ) = (da ) (a,, ) ((>,, ) ((>,, ) 

And since the matrices (a^) and (6^) are necessarily invertible, we can multiply both 
sides on the left by (a^) -1 and on the right by (fe^) -1 , and we are left with 



(bij)(aij) = (a i:j )(b 



XI > 



This proves the first part of the theorem. Lastly, we wish to see that a is the only 
possible comodule structure on V restricting to p and p. Let t : V — >V®A®Bhe 



187 



any such comodule, and write 



i 

t restricting to p and /i means that 1 <8> Sb '■ w%j >-» ciij and that e A <8> 1 : Wij h-> 
By virtue of r being a comodule map we have 

and, using the Hopf algebra identity (e<g)l)oA = (l<g)£)oA = l, 

(A A <g> A B )(^a ifc <g> b kj ) = (A A ® A B )(J^(1 <g> £ B )Kfe) <8) (e A <8> 

= w ik <g> 

But A^ <g> A B = A is a 1-1 map (as all co-multiplication maps are), and sends 
and a ik <S> to the same thing. We conclude then that = J2 k a ik <S> b k j. This 
completes the proof. 

□ 

11.1 Constructions and Morphisms 

Here we record some facts about certain constructions and morphisms on repre- 
sentations oiGxH, relative to the induced representations on G and H. Throughout, 
let A and B be the representing Hopf algebras of G and H, V and W fixed finite 
dimensional vector spaces spanned by {ej} and {fj} respectively, and endowed with 
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A eg) _B-comodule structures a and r, given by 



a :ej i — y 




Further, let a^, a"s, t^, r B be the induced co module structures on A and -B, given by 



Then we know from the previous section that the matrix (zy) is equal to the matrix 
product (aij)(bij), and similarly = (lyjfmy), that the matrix (oy) commutes 

with and that (Zy) commutes with (my). Since a and r are the unique repre- 
sentations restricting to the given ones, we shall say that they are induced by o~a,ob 
and ta, t~b respectively. 

Proposition 11.1.1. The direct sum <t©t is induced by the direct sums <7a©Ta and 

Proof. If we view (zij) and (wij) as the matrix formulas for the representations a and 
r, then the matrix formula for a © r in the basis {e^} U is 







(%) 



\ 



V 




which we can write as 



/ 



(a«)(6«) 



\ 




\ 



V 



(lij)(mij) J 
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This last formula is exactly that of the representation on G x H induced by that of 
o~A © Ta and a B © r B . □ 

Proposition 11.1.2. The tensor product of representations a<g>T is induced by o- a ®ta 
and a B (g) r B . 

Proof. This is merely the observation that tensor product of matrices commutes with 
matrix multiplication. The matrix formula for the representation a Cg> r is 

( Zij ) <g> ( Wij ) = (aijWij) <g> (h^irriij) = [( aij ) ® (^)][(^) <8> (m^-)] 



which is the representation induced by ® t\ and <tb <8> Tb. 



□ 



Proposition 11.1.3. A linear map <fi : V ^ W is a morphism between the repre- 
sentations (V, a) and (W, r) in the category Rep k G x H if and only if it is both a 
morphism between the representations {V, a a) and (W, ta) in the category Rep k G and 
between (V, a B ) and (W,t b ) in the category Rep k H . 

Proof. Write as the matrix (kij) in the relevant bases. Recall that, being a mor- 
phism in Rep fc G x H is equivalent to the matrix equality = {wij){kij). Then 
the 'if direction of the theorem is obvious, since by assumption (z^) = = 
(bij)(aij), similarly for (wy). 

Conversely, consider the diagram 



V 



V®A®B 



W 



V ®A®B 



V® A 



</«S) i 
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V 

> V <g> A 



Commutativity of the top rectangle is the statement that is a morphism between 
(V, a) and (W, r), and commutativity of the bottom rectangle is true no matter what 
is. This gives us commutativity of the outermost rectangle, which is the assertion 
that is a morphism between (V, a a) and (W, ta) in Rep fc G. An identical result holds 
in Rep k H if we consider ea instead. 

□ 
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Chapter 12 

The Additive Group 



In this chapter we give a complete characterization of the finite dimensional repre- 
sentations of the additive group G a over any field, using no more than the combinato- 



rial methods described in chapter 10 The goal, as with both of the unipotent groups 
we'll be investigating, is to show that, for characteristic p > large with respect to 
dimension, modules for G a in characteristic p look exactly like modules for G™ (direct 
product of copies of G a ) in characteristic zero. This is by far the easiest case we'll 
consider, as even when p « dim, the analogy is still very strong (which is atypical; 
modules for the Heisenberg group, discussed later, in dim >> p do not share this 
property) . 

We start by considering the group the countable direct product of G a ; all we 
need for the group G a will emerge as a special case. For a fixed field k we identify 
the Hopf algebra A for the group G^ as k[x\, x 2 , ■ ■ .], the free algebra on countably 
many commuting variables, with A and e defined by 

A = k[x!,x 2 , ■ ■ •] 
A : Xi i-> 1 <g> Xi + Xi <g> 1 
£ '. x% i — y 
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Let (V, p) be a comodule over A, fix a basis {e^} of V, and write 

a '.y 

i 

Ojj — ^ ] CjjX 
r=(n,...,r n ) 

where we adopt the notation Notice that only finitely many of 

the variables Xi can show up in any of the a^-, so for all intents and purposes, this is 
really just a comodule over a finitely generated slice of A, namely k[x\, . . . ,x n ], the 
representing Hopf algebra of the group G™. 



12.1 Combinatorics 



We look first at equation 10.0.1 



(e(ay)) = Id 



Note that e acts on by simply picking off its constant term c°-. The above thus 



reduces to the matrix equality 



Id 



This equality is intuitively obvious, for when we evaluate the matrix formula of the 
representation at X\ = . . . = x n = we are left with {cij)^ 0, "' ,Q \ which should indeed 
be the identity matrix. 



Next we look equation 10.0.2, namely ^2 k a>ik <8> a,kj — ^( a ij)- Working with the 
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right hand side first we have, making repeated use of the binomial theorem 

r 

= ^ C tj{ 1 ®Xl+Xl® l) n • • • (1 ® Xn + Xn <E> lf n 



r 




Here we have adopted the notation, for two mtuples of non-negative integers rh = 
(mi, . . .,m n ) and f = (n, . . . ,r n ), 

m + f— (mi +n, . . . ,m n + r n ) 

and 

/ m^ 

For the left hand side we have 

^ a ifc <g> a fci 

k 



~ ( ( ( "M 

\r\ J \ r 2 J " \r n J 

f,s \ k / 
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Thus, equation |10.0.2| reduces to 




r,s \ k / r k+T=r 



X^ ® X' 



1 



Now the polynomial ring k[xi, . . . , x n ] has no relations, whence the collection of all 
monomial tensors x r ®x s for varying r and s constitutes a free basis for k[x±, . . . ,x n ](& 
k[xi, . . . , x n }; we can therefore simply match coefficients. In the left hand side of the 
above equation, clearly each monomial tensor occurs exactly once, and its coefficient 
is ^2 k c r ik c s k y In the right hand side it is also true that each monomial tensor occurs 
exactly once, for if you choose k and /, there is only one r in whose summation the 
term x k £g> x l will occur, and there it occurs exactly once. The coefficient of the 
monomial tensor x' r <g> x s on the right hand side is thus ( r ~t*) c^ + ^ . Then we have 



For every i, j, r*and s. But the right hand side is simply the (i, j) th entry of the matrix 



for every f and s. This equation, along with = Id, and the requirement that 

(cij) r should vanish for all but finitely many r, are necessary and sufficient for a 
collection of matrices {cij) r to define a representation of or over any field k. 
In the case of G a = G\ the above equations reduces to 




k 



(cij) r (cij) s , and the left hand side is the (i,jY h entry of the matrix ( r t g j(cy) r 



Equation 10.0.2 is therefore equivalent to the matrix equality 




(12.1.1) 



(en) = Id 
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and 

(12.1.2) 

Again, these equations, along with the requirement that (cij) r vanish for large r, are 
necessary and sufficient to define a representation of G a . 

For the rest of this chapter we restrict to the case of G a , and treat the case of zero 
and positive characteristic separately. 



r + s 
r 



12.2 Characteristic Zero 

Let k have characteristic zero. 

Theorem 12.2.1. Every n-dimensional representation of G a over k is of the form 
e xN , where N is an n x n nilpotent matrix with entries in k. Further, any n x n 
nilpotent matrix over k gives a representation according to this formula. 

Proof. By e xN we mean the sum 

X 2 N 2 x m N m 

1 + xN + — — + ... + — 

2 ml 

which of course terminates since N is nilpotent. Obviously such a formula gives a 



representation, in view of the matrix identity e xN e yN = e ( - x+y ^ Ar (see lemma 13.5.2). 



For the converse let p : ej J2i e « ® a ij be any representation, and set iV 



C,; 



vl 



Then examination of equation |12.1.2 gives, for any r > 

N r 



Since (cjj) r must vanish for large r, N r = for large r, whence N is nilpotent. 
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Recalling that the matrix formula for this representation is 

(Cij) + ( Cij ) l x + . . . + (dj) n x n 

where n is the largest non-zero (c^), this representation is indeed of the form e xN . □ 

In the preceding proof we used the fact that char(/c) = in the form of assuming 
that ^ is defined for all non-negative integers r. 
Example: take the 4x4 nilpotent matrix N = 

^0100^ 

10 

1 
v i 

Then the representation it defines is e xN = 

2 6 
1 X % 

1 X 

\ 1 / 

Proposition 12.2.2. Two representation e xN ,e xM are isomorphic if and only if the 
nilpotent matrices N and M are conjugate. 

Proof. Sufficiency is immediate. For the necessity, suppose e xN and e xM are isomor- 
phic via a base change matrix P. Then 

Pe xN p -i = l + xPNP ~i + ... = 1 + X M + ... 
which forces PNP' 1 = M. □ 
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Corollary 12.2.3. Let k have characteristic zero. 

1. For an n- dimensional representation of G a over k, the polynomials occurring as 
matrix entries cannot have degree larger than n — 1 

2. For a given dimension n, there are only finitely many non-isomorphic n-dimensional 
representations of G a over k 

Proof. If N is nilpotent, it is nilpotent of order no greater than the dimension of the 
representation; this proves 1. Any nilpotent matrix is conjugate to a Jordan matrix 
with O's on the main diagonal, of which there are only finitely many for a given 
dimension; this proves 2. □ 

12.3 Characteristic p 

Let k have characteristic p > 0. In this section we prove 
Theorem 12.3.1. Any representation of G a over k is of the form 

e xNo e xPN le xP 2 N 2 e xP m N m 



with each of the factors commuting (and so necessarily and sufficiently all of the Ni 
commuting) , and each Ni being nilpotent of order < p. Further, any finite collection 
of commuting, p-nilpotent matrices defines a representation according to the above 
formula. 



Note that, for a sequence of assignments to the (cij) n as in equation 12.1.2, satis- 
fying the relations (cij)° = Id along with 
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are still necessary and sufficient to determine a representation, no matter the charac- 
teristic. The gist of the theorem is that, while in characteristic the entire represen- 
tation is determined by the nilpotent matrix (c^-) 1 , in characteristic p the binomial 
coefficient ( r + s ) can and often does vanish, which relaxes the relations that the (c^) 
matrices must satisfy. This is to be expected, since for example 

' 1 x* m ^ 

v° 1 J 

defines a perfectly respectable representation of G a in characteristic p, for any m as 
large as we like. This illustrates that, for example, (cjj) pm is not determined by the 
matrix (c^) 1 . 

It turns out that one can choose the matrices (cij) pm freely, subject only to the 
condition that they commute and are nilpotent of order < p, and that these completely 
determine the rest of the representation. These (cij) pm matrices for m > 1 should 
thus be thought of as accounting for the "Frobenius" parts of the representation. 

To start, we need some number theory concerning the behavior of binomial and 
multinomial coefficients modulo a prime. The notation ( a b n J denotes the usual 
multinomial expression , & " ! , . 

Theorem 12.3.2. (Lucas' theorem) Let n and a,b,...,z be non-negative integers 
with a + b + ... + z = n, pa prime. Write n = n m p m + n m _\p m ~ x + . . . + no in p-ary 
notation, similarly for a,b, . . . ,z. Then, modulo p, 

n \ j if for some i, Oj + h + . . . + z% > p 
a ' 6 '---'^ ! ( "° )( ; ni )...( " m ) otherwise 

I \ao,bo,...,zo/ Voi,bi,...,zi/ Va m ,o m ,...,2 m / 

Some corollaries we will need later: 
Corollary 12.3.3. Let p be a prime, n,r and s non-negative integers. 
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1. The binomial coefficient (™) is non-zero if and only if every p-digit ofn is greater 
than or equal to the corresponding digit of r 

2. If n is not a power of p, then for some < r < n, (") is non-zero 

3. If n is a power of p, for every < r < n, (™) is zero 

4- ( r ^ S ) i s non-zero if and only if there is no p-digit rollover (i.e. carrying) for the 
sum r + s. 

See [6] for a proof of these facts. 

Theorem 12.3.4. A representation of G a over k given by the matrices (cjj) n is 
completely determined by the assignments 

( Cl3 y°=x , ( Cij y 1 = x 1 , ...,( Cij y m = x m 

(with the understanding that {cij) pk = for k > m). The X { must necessarily com- 
mute with each other and satisfy Xf = 0. 



Proof. All is proved by examining equation 12.1.2 It is easy to see by induction that 



the values of {cij) n are determined by the X{. If n is a power of p then its value is 



given, and if not, by 2. of corollary 12.3.3 there is < r < n with (™) ^ forcing 



( Ci4 ) \ I ( Cij J ( Cij j 



For the commutativity condition, if n ^ m, then by theorem 12.3.2 ( p ^ ) is 
non-zero, and we must have 



^ V 1 Y Y — I V V \ Y Y 



p ,n J \ p 
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To prove the nilpotency claim, consider 



( \v m = x 



{Cij) 



2p" 



2p m \ 1 



Xi 



p-i 

n 

,fc=i 



kp r ' 



xp- 1 

m 



Noting that there is carrying in computing the sum (p m+1 — p m ) +p m , corollary 12.3.3 



tells us that ( p 1 ) = 0, and we have 



V 



m+l 



p r, 



X„ 



+ 1 = [Ci 



)(P-1)P™ ( Cij )P m 



p-1 

n 

,fc=2 



n -1 



forcing X^ = 0. 



□ 



We have shown thus far that commutativity and p-nilpotency of the Xi are 
necessary to define a representation; we must now show sufficiency This will be- 
come clear once we have a closed expression for (cij) n in terms of the X{. For 
n = n m p m +n m _ip m ~ 1 + . . .+n inp-ary notation, let T(n) = n !ni! . . . n m \. Obviously 
r(n) is always non-zero mod p. 

Proposition 12.3.5. Let Xq, . . . ,X m be pair-wise commuting p-nilpotent matrices, 
and let n = n m p m + . . . + n$ be the p-ary expansion of n. Then the assignment 



{c lJ Y = T{n)- 1 X^X^...Xl 



defines a representation of G a over k. 



Proof. Obviously these assignments satisfy (c^-) = Id, with (cy) n vanishing for large 
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n (we define = for i > m). Then it remains to check the equation 



r + s 



ipij) — i c ij) { c ij , 



Let r = r m p m + . . . + r , s = s m p m + . . . + sq, and suppose first that ( 1 ^ s ) = 0. This 



means, by corollary 12.3.3, that there is some digit rollover in the computation of 



r + s, i.e. r, + Si > p for some i. Looking at the right hand side in view of the given 
assignments we see that X? * will occur as a factor. But is nilpotent of order 
less than or equal to p, so the right hand side will be zero as well. 

On the other hand, if ( r ^ s ) ^ let r + s = z m p m + . . . + z , so that necessarily 
r i + Si = Zi for all i. Then the given assignments give the same power of each on 
either side, so it only remains to check the coefficients. This reduces to 



r + s 



r 



T(r)T(s) =r(r + s) 



After applying theorem 12.3.2 for the term ( r ^ s ), the equality is clear. 



□ 



We can now prove theorem 12.3.1 Let X , . . . , X m be commuting, p-nilpotent 



matrices over k. Then according to the previous theorem, the representation they 
define is 



D m + 1 -1 



n m + l_ 



fo)V= £ r(rr 1 Xr°X?...X, 

r=0 r=0 

p-1 p-1 p-l ^ 

= Y Y ■ ■ ■ Y ~\ ■ ■ ■ ~\ X 

r o=0 r 1=0 r m =0 

E (£ ^ 

r =0 u / \n=o 



r m T n-)+rip+...+rmP n 
m 



ri x rip 



P-1 j 

r. 



m 



\r m =0 
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as claimed. All of the factors of course commute, since the X { do. 
Example: define the following matrices: 



f 1 ^ 



V 



o o 1 





1 2 ^ 



V 








/ 



These all commute and are nilpotent of order less than or equal to 3. Then the 
representation they define in characteristic 3 is e xX o e x 3 Xi e x 9 x 2 _ 



/ 



V 



1 x + x 3 2x 2 + x A + 2x 6 + 2x 9 N 



x + x A 



J 



With a view toward defining the height-restricted ultraproduct later, we make the 
following simple but important observation. 

Theorem 12.3.6. 

1. Let k have characteristic zero. Then the n- dimensional representations of 
over k are in 1 — 1 correspondence with the finite ordered sequences Ni of nx n 
commuting nilpotent matrices over k, according to the formula 



N„ 



2. Let k have positive characteristic p. Then if p » n, the n-dimensional rep- 
resentations of G a over k are in 1 — 1 correspondence with the finite ordered 
sequences Ni of n x n commuting nilpotent matrices over k, according to the 
formula 

e xN 0( ^PN 1 e xP m N m 
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Proof. By the work done in chapter 11 , all representations of over k are given by 
commuting finite products of individual representations of G a over k. It is easy to 
see that the representations e xN and e xM commute if and only if N and M do; this 



proves 1. 2. follows immediately from 12.3.1, with the additional realization that if 
p is greater than or equal to dimension, being nilpotent and p-nilpotent are identical 
concepts. 

□ 
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Chapter 13 

The Heisenberg Group 



In this chapter we investigate the group Hi of all 3 x 3 unipotent upper triangular 
matrices 

( 1 \ 

1 x z 



1 y 
1 



for arbitrary x, y and z. Our intent is to prove a theorem analogous to theorem 12.3.6 



for the group G a ; that is, if one is content to keep p large with respect to dimension 
(larger than twice the dimension in fact), modules for Hi in characteristic p 'look like' 
modules for H^° in characteristic zero. This result will be more precisely stated in 
a later chapter, as we consider the 'height-restricted ultraproduct' of the categories 
Rep^i^i for a collection of fields ki of increasing positive characteristic. The results of 
this chapter are perhaps more surprising than the previous in that, unlike modules for 
G a , modules for Hi in characteristic p « dim look hardly at all like representation 
for H^° in characteristic zero; it is only when p becomes large enough with respect to 
dimension that the resemblance is apparent. 

It was the author's original intent to prove these results for all of the generalized 
Heisenberg groups H n , but that is not attempted here. Nonetheless, we shall at least 
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work out the fundamental relations for the groups H n , as all we need for Hi shall 
arise as a special case. 



13.1 Combinatorics for H. 



n 



Let H n , the n th generalized Heisenberg group, be the group of all (ra + 2) x (ra + 2) 
matrices of the form 

( i ^ 

1 Qlj ~^ • • • yi % 

i o Vl 



1 Vn 



\ 



J 



That is, upper triangular matrices with free variables on the top row, right-most 
column, l's on the diagonal, and O's elsewhere. The Hopf algebra for H n is 



A — k[Xi, . . . , x n , yi, . . . , y n , z] 

n 

A : Xi I-)- 1 <g) Xi + Xi <g) 1, y» >-» 1 <S> y» + y» <8> 1, 2i->;z<g)l + l<g)2: + ^ Xj <g> 

£ : Xi,yi,z h-> 



i=i 



Let us adopt the notation, for an n-tuple of non-negative integers r, x r = a^ 1 . . . x^ n , 
and similarly for y r . Let (V, p) be a comodule for G with basis {e^}, and p given by 

a ij 



and write 



(r,s,t) 
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Lemma 13.1.1. With as above, A(ajj) is equal to 



E E 

a) \0<f<ri,S2 

(r 2 ,s 2 . 6 ) 



fi + r 2 - A /si + s 2 - A / a + 6 + |t| 
r 2 A si A a, 6, 



(r i +^2 - *i si +S2 - Ca+6+ 1 i| ) 



Remark: the summation condition < t < ri, s 2 is understood to mean all t such 
that every entry in t is no larger than either of the corresponding entries of in ri or 
s 2 . \t\ means t± + . . . + t n . 

Proof. We start by computing 

f,s,t 

We have 

A(i r )=A(x 1 ) ri ...A(x„) r » 

= (xi <g> 1 + 1 <g> xx) ri . . . (x n <g> 1 + 1 <g> a; n ) r " 




where ('"j^" - ) is shorthand for the product • • • C"™" 1 ™)- Similarly we have 

f+g=s 
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and 



A(V) = (z <g> 1 + 1 <g> z + ^Xi® Vif 

i 



a + 6 + c\ 6 ^ / |t| 



® y* 



a+6+|*1=t 

def 



^ / a + b + 1*1 
/ — ' a,b,t l , . . . ,t n 



x t z a <g> y l z h 



where |t| = t\ + . . . + t n . Thus A(oy) is equal to 

V : S : t Z + m — r 

f+g=s 
a+b+\t\=t 



We seek to write this as a sum over distinct monomial tensors, i.e. in the form 



r 2 ,s 2 ,b 



for some collection of scalars x, which is merely a question of how many times a given 
monomial tensor shows up as a term in our summation expression of A(ajj). That is, 
how many solutions are there to 



x l+t y f z a <g> x'V+V = x ?1 y Sl z a <g> ^ 2 / 2 z 6 



Clearly the values of /, m, a and 6 are determined. Further, once one chooses t, the 
values of both / and g follow; thus, we can parameterize by t. For the n-tuple t to 
induce a solution, it is necessary and sufficient that none of its entries be larger than 
the corresponding entries in f\ or s 2 ; we shall express this condition by < t < fi, s 2 . 
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Then 

X 



— * — * _, 

fi, si, a\ _ (^t) fl + m\ff + g\f a + b+\t\ 

r 2 ,S2,b) , ^ v V m /V <7 / \a,6,*i,...,* n 

0<t<n,S2 



and upon substituting 

m = r 2 f — Si s = Si + S2 — t r = r 1 + r 2 — t t = a + 5 + |tj 



we get 



E 

0<f<ri,s 2 



ri,si,a 

rt + f 2 - A /si + s 2 - A / a + 6 + \t\ \ (r 1+ p 2 -t,s 1 +s 2 -t,a+b+\t\) 



X 



^ I f 2 ;V «i J\a,b,t u ...,t n l Cij 



which proves the lemma. 

□ 

Theorem 13.1.2. A finite collection of (c^) matrices defines a module for H n if 
and only if (cjj)^ ' 0,0 ^ = Id, and for all ri, r 2 , a, si, s 2 , 6, £/ie following matrix equation 
holds: 



0<t<ri,S2 



(Ci^^icij)^ 2 '* 2 ^ (13.1.1) 

ri + f 2 - A /si + s 2 - A / a + 6 + |t| 
f 2 A si A a, 6, t r 



Proof. This follows by matching coefficients for the equation A(ay) = <8> 
The coefficient of the monomial tensor x Tl y Sl z a ®x r2 y S2 z b for A(ajj) is the right hand 
side of the above equation, as proved in the previous lemma, while for J2 k a ik ® 
it is the left hand side of the above equation, as is easy to verify. □ 
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13.2 Combinatorics for H\ 

The Hopf algebra (A, A, e) for the group Hi over the field k is 



A = k[x,y,z] 

A : x i-)- 1 eg) x + x eg) 1, y (->• 1 eg) y + y eg) 1, z^l®z + a;(8)?/ + z(g)l 

e : x, y, z i— )• 

Let be a finite dimensional vector space over k, p : V — )■ V ® A a, A;-linear map. 
Fix a basis {e^} of V, and write 

p '. Cj i — Y ej Cg) 

i 

Each a^j e A is a polynomial in the variables x, y and z, so write 

= E 4 xri y r2zr3 

r 

where the summation is over all 3-tuples of non-negative integers, remembering of 
course that = for all but finitely many r. 

Theorem 13.2.1. A finite collection of {cij) r matrices defines a module for Hi if 
and only if they satisfy (q 7 -) ( - ' ' - ) = Id, and for all 3-tuples f and s 



min(ri,S2) 



[C-ij) [C-ij t 



1=0 



r x + si - l\ fr 2 + s 2 - l\ /r 3 + s 3 + I 
si )\ r 2 J\ r 3 ,s 3 ,l 



[Cij 



(13.2. V 



Proof. Apply theorem 13.1.2 to the case of n = 1 



□ 



We will also make good use of the fact that G contains three copies of the additive 
jroup G a , one for every coordinate. This says that, for example, the collection of all 
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matrices of the form (cjj)^ r,0 '°^ (the matrices representing the x-coordinate in the 



representation) must in isolation satisfy equation 12.1.2 



( C ij) 



(r,0,0)/ .^(«,0,0) 



r + s 



r 



(r+s,0,0) 



An identical statement holds for matrices of the form (cij)^°' r '°^ and those of the form 
(c i jp '°' r \ the y and z parts respectively. These relations could of course just as well 



have been read off of equation 13.2.1 



From here on we treat the cases of zero and prime characteristic separately. 



13.3 Characteristic Zero 



Let k be a field of characteristic zero, and (V, p) a representation of Hi as in the 



previous section. Set X = (cy) (1 '°' 0) , Y = (c ij )<- > 1 >°\ Z = (c, 



\ (o,o,i) 



Theorem 13.3.1. A representation of Hi over k is completely determined by the 
assignments X and Y . Necessarily Z = [X, Y], each of X and Y must commute with 
Z , and X, Y and Z must all be nilpotent. Further, any X and Y satisfying these 
relations defines a representation of G over k. 

Proof. We know from our previous work with the additive group G a that the following 
identities must hold: 



( Cii ) (r '°' 0) = ^x r ( ClJ y°^ = ^Y r (c l3 y°^ = ^z r 



We work with the fundamental relation for Hi, equation |13. 2.1 

min(r'i,S2) , j\ / i\ / r\ 



„ \(r 1 +sl~l,r 2 +s 2 -l,l) 
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We have 



1=0 



which says that 



/ n — l\ fm — l\ fl 



I \ (n,m,G) _ ymjyn 

lj m\n\ 
Using again the fundamental relation, we also have 

M^Kcn)^ = ( Cii ) (n ' m,fc) 

which together with the last equation gives 



, )(n,m,fc) = _^vm X "Z fe 

n\m\k\ 



Thus, all of the (c^-) are determined by X, Y and Z, according to the above formula. 
Further, 

and so Z = [X, K] as claimed. Each of Y and X must commute with Z, for if we apply 
the fundamental relation to each of XZ and ZX, in each case we obtain (c^p 1 ' ' 1 ', 
showing XZ = ZX, and an identical computation shows YZ = ZY. And by our 
work on G a we know that each of X, Y and Z must be nilpotent. 

We must now show sufficiency of the given relations. Let X, Y and Z be any 
three nilpotent matrices satisfying Z = XY — YX, with each of X and Y commuting 



with Z. We need to show that the fundamental relation, equation 13.2.1 is always 
satisfied. We assign 

( \{n,m,k) = _J_ Z k Y m X n 

n\m\k\ 

Since each of X, Y and Z are nilpotent, (cy) r will vanish for all by finitely many 
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r, as required. The fundamental relation, with these assignments, reduces to (after 
shuffling all coefficients to the right-hand side and some cancellation) 



min(ri,S2) 



"""V'i>»^ / \ / \ 

grs+szY^ x ri Y S2 X Sl = \ ^ l\ ( 1 ) | I z rz+S3+l Y r2+S2 ~ l X ri+Sl ~ l 

'WW 

Each term in the summation has the term Z r:i+S:i in the front and X Sl in the rear, 
and so does the left-hand side. So it suffices to show 

min(ri, S2 ) / \ / \ 

Y r 2X Tl Y S2 = ^ ^ l\( \( jZ l Y r2+S2 ~ l X ri ~ l 

and since Y commutes with Z, the summation term (minus coefficients) can be written 
as Y T2 Z l Y S2 ~ l X Tl ~ l . We can now take off the Y T2 term from the front of either side, 
so it suffices to show 

min(ra,m) 



X n Y m = nr\r l \z i Y m ~ i x n ~ 



where we have renamed r\ and si with the less cumbersome n and m. 

We proceed by a double induction on n and m. The case of n or m being zero 
is trivial, and if n — m — 1, the above equation is XY = Z + YX, which is true 
by assumption. Consider then X n Y, and by induction suppose that the equation 
holds for X n ' x Y, so that X n ~ l Y = YX 71 ' 1 + (n - f )ZX n ~ 2 . Then using the relation 
XY = Z + YX and X commuting with Z, we have 

X n Y = X n ~ l XY 

= X n -\Z + YX) 

= ZX n - x + {X n - X Y)X 

= ZX n - 1 + (YX^ 1 + (n- l)ZX n - 2 )X 

= nZX n ~ x + YX n 
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and so the equation is true when m — 1. Now suppose that m < n, so that 
min(n, m) = m. Then we have 

= (FX n + nZX n ~ 1 )Y rn - 1 

= Y(X n Y m - 1 ) + nZ[X n - x Y m - 1 ) 

which by induction is equal to 



- (g«(;) ("7 + nZ (g«(» 7 (•» r 

= Y ra x n + ^ j, ^ ^ " ^ ^ym-l X n-l + ^>(Z - l) ! (j " ^ ^^"'^ 

where, in the last step, we have chopped off the first term of the first summation 
and shifted the index I of the second summation. If we chop off the last term of the 
second summation we obtain 

= ym X n + J- j, ^ (™ ~ ^ ^ym-l x n-l 



m - 1/ \m - 1 
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and upon merging the summations, we have 



Y m x n + 



'n\ fm — 1\ ... (n - 1 



,.!,..,-])!(" Mr" r U""v"- 

,m — 1 / \m — 1 , 



— +gKi)C 7 7 1 ) 



+ n!| n )( m \Z m X n ~ n > 



+ n{l - 1)! 



n — 1 
/ - 1 



m — 1 
/ - 1 



m — 1 
I - 1 



r^Z y m— 1 J^" n_ 



The two terms outlying the summation are exactly the first and last terms of what 
the fundamental relation predicts them to be. To finish then, it suffices to show that 
the term in brackets is equal to /!(") (™), which is a straightforward computation left 
to the reader. This completes the case of m < n, and the case n > m is hardly any 
different, and left to the reader. 

□ 



13.4 Characteristic p 

Here we are not interested in giving a complete combinatorial classification of 
characteristic p representations of Hi. Rather, we shall only be interested in the case 
where p is sufficiently large when compared to the dimension of the module. Doing 



so, we obtain a result analogous to theorem|12.3.6 for the group G a , namely that such 



representations 'look like' representations of H™ in characteristic zero. 

Let (V, p) be a comodule for G over the field k of characteristic p > 0, given by the 
matrices (c^Y over k. Again, matrices of the form (cjj)^' 0,0 ), (cjj)( ' r '°) and (cij)^°'°' r \ 
the x, y and z parts of the representation respectively, must in isolation define rep- 



resentations of G a over k. We know then that (proposition 12.3.5), for example, all 
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matrices of the form (Q J -) (r ' ' ' ) are completely determined by the assignments 

X = X x = (c ii ) (plA0) , • • • , X m = (^) (pm '°' 0) 

and abide by the formula, for r = r m p m + r m _ip m_1 + . . . ro in p-ary notation 

( Cjj )M,o) = r(r) -i X n» X[ i . . ^ 



and that the Xj must commute and be p-nilpotent. An identical statement holds for 
the matrices Y m = (c^ -^ ) and Z m = (c ij ) (0 * ' pm) . 

From here on we adopt the notation = (cy) , similarly for Y(j) and 
Note that Xn\ and Xj are not the same thing. 

Theorem 13.4.1. Let k have characteristic p > 0. A representation of H\ over k 
is completely determined by the Xi and Y{. The X{ must commute with one another, 
same for the Yj, and Z i} and each Xi must commute with every Zj, same for Y$ and 
Zj. 



Proof. We work again with the fundamental relation for Hi, equation 13.2.1 

min(ri ,S2) 



(Cij 



1=0 



taking care of course to realize when a given binomial coefficient is or is not zero mod 
p. We begin with 

Y (m) X {n) = ( Cii ) (0 ' m ' 0) (%) (nA0) 



' \ / \ /; 



E 

1=0 



0J \ / V/ 

(n,m,0) 
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and 



Y (m) x (n) z {k) = ( Clj )^\ Cij r>w 




(n—l,m—l,k+l) 



, \(n,m,k) 
■13) 



Thus we have a formula for an arbitrary (c^) matrix: 



(cij) ( "' m ' fc) = F( m )X( n )Z( fc) 



We now show that each of Zi are determined by the X; and Yj. The fundamental 
relation gives, just as in characteristic zero 



showing Zq = [X ,Y ]. Now assume by induction that Zi is determined by the X t 
and Yi for i < m, and we have 



For / < p m , Z(i} is determined by the Zj for i < m, who in turn, by induction, 
are determined by the Xi and Y^. Every term in the summation is thus determined 
by the Xi and Y iy hence so is Z m , the outlying term. This shows that the entire 
representation is determined by the Xi and Y^. 

To see that each Xi commutes with every Z,-, simply apply the fundamental 



XoY = (cijf^ia^ ^ = Y X + Z 



X Y - (r- )(P m > 'V(r- )(°'P m >V 
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relation to both XiZj and ZjX i} for which you get the same answer. Do the same for 
Yi and Zj, and this completes the proof. 

□ 

We ask the reader to note that it is not generally the case that Z rn = [X m , Y m ] 
for m > 0, nor is it the case that X^ and Yj commute for i ^ j (the author verified 
this with several counter-examples which he will not burden you with). However, we 
will see now that these relations do in fact hold so long as p is sufficiently large when 
compared to the dimension of a module. 

Lemma 13.4.2. Suppose that p is greater than twice the dimension of a module, and 
that the sum r + s carries. Then at least one of P( r ) or Qf a \ must be zero, where P 
and Q can be any of X , Y or Z. 

Proof. The key fact is that since the Xi, Yi, and Zi are all nilpotent, they are nilpotent 
of order less than or equal to the the dimension of the module, which we assume is no 
greater than p/2. Since the sum r + s carries, we have + Si > p for some i, whence, 
say, Ti > p/2. Then 



p M = T(r)- 1 P^ ...PP ...PI" 



is zero, since Pp is. 



□ 



Proposition 13.4.3. Suppose p is greater than or equal to twice the dimension of 
a module. Then the following relations must hold: Z m = [X m , Y m ] for every m, and 
X m Y n = Y n X m for every m^n. 



Proof. Consider the fundamental relation, equation 13.2.1 , applied to X m Y r 



V 1=1 ^ 



p m - 1\ (p m - 1\ n 
o [i 
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For every < / < p m there is clearly some carrying in computing the sum (p m — l) + l, 



so lemma 13.4.2 says that the summation term ZtnY^m^AX^mn is always zero, since 
at least one of Zm or K( p m_^ is zero. This gives Z m = [X m , Y m ] as claimed. 
Now let n 7^ m, and consider the fundamental relation applied to X m Y n : 



min(p"',p m ) 



/ > / p m _ A / p n _ A /A 

X m Y n = Y n X m + I \ / \ / \l) ^(')^0' n -0 J ^'(p m -0 

In case m < n, for every value of I in the above summation, (p n — I) + / has digit 
rollover, again forcing at least one of Zm or Yr p nn to be zero, forcing every term in 
the summation to be zero. A similar statement holds in case n < m. This proves 



X m Y n = Y n X m , as claimed. 



□ 



Thus far we have shown that, for p > 2d, every c?-dimensional module must satisfy 
at least those relations that representations of G x G x . . . over a field of characteristic 
zero must satisfy. We now show sufficiency 

Lemma 13.4.4. If p is greater than or equal to twice the dimension of a module, 
then for any r and s 

The same holds if we replace Z with XorY. 

Proof. In case the sum r + s does not carry, we know from our previous work with G a 
(or direct verification) that the equation is true, just by checking the assignments of 
the in terms of the Zi (This is true even without the hypothesis that p be large). 
If on the other hand the sum does carry, then the binomial coefficient on the right is 
zero by corollary |12. 3. 3 But so is the product on the left, by lemma 13.4.2 □ 



We can now prove 
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Theorem 13.4.5. Suppose p > 2d. Let X it Yi and Z i be a finite sequence of d 
matrices satisfying 

1. The Xi, Yi, and Z { are all nilpotent 

2. Zi = [Xi, Yi] for every i 

3. [Xi, Zi] = [Yi, Zi] = for every i 

4- For every i ^ j , X^Y^Zi all commute with Xj,Yj, Zj 
Let n = n m p m + n m _xV m X + • • • + n\p + n , and assign 

X {n) = T(n)- 1 X^...X^ 

and similarly for Y( n ) and Z^ . Set 

( c ..)(^>*) = Z {k) Y {m) X {n) 

Then these assignments define a valid d-dimensional representation of G over k. 
Proof. For arbitrary n, m, k, r, s and t, the equation we must verify is 




(n+r— l,m+s— l,k+t+l) 



which, with the given assignments and assumptions, can be written 

Z(k)Z( t )Y( m )X( n )Y( s )X( r ) 

min(n,s) , lS , ,\ ,\ 

^ ' n + r - / \ m + s - I \ k + t + A 
= ^ I r ){ m ){ k,t,l ) Z ^) Y (m + s-i)X< 
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Lemma 13.4.4 gives the identities 



k + t 



Z(k)Z(t) - \ f J ^ (/, + /) 

^(m)^M = ^ + m )Y {m+s _i ) 

( Tl ~\~ T — / 
X (n-l)X(r) = ( r )X {n+r _ l) 



so we can rewrite our equation as 



, - min(n,s) , 

k + t\ ^ S fk + t + l 

\Z [k+t) Y {m) X {n) Y (s) X {r) - I 1.1 

' 1=0 ^ ' ' 



Z{k+t+l)Y{m)Y{s-l)X( n -l)X( r ) 



First suppose that the sum k + t carries. In this case the equation is true, since 
the left hand side binomial coefficient vanishes, and the right hand side multinomial 
coefficient vanishes for every /, causing both sides to be zero. We assume then that 
k + t does not carry, so we can divide both sides by ( to yield 



min(ra,s) 



k + t + l 



Z(k+t)Y(m)X( n )Y(s)X(r) = y / J Z(k+t+l)Y( m )Y( s -l)X( n -t)X( r ) 

Now apply Z (fc+t+/) = Z( k+t) Z {i) : 

min(n,s) 

Z(k+t)Y( m )X( n yY( s )X( r ) = ^ Z(k+t)Z(l)Y(m)Y(s-l)X(n-l)X( r ) 

1=0 

We have Ztk+t) m the front and Xt r \ in the rear of both sides, so it suffices to show 



min(n,s) 

n—l) 



1=0 



and since Yt m \ commutes with Zn\, we can move it to the front of the right hand side, 
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and then take it off both sides, so it suffices to show 



mm(n,m) 

X(n)Y( m ) = Z(i)Y( m _i)X(, n _i) (13.4.1' 



1=0 



where we have replaced s with the more traditional m. 

Now we begin to replace the X',~s with their definitions in terms of the X^s, 



similarly for Y and Z, so that the left hand side of equation 13.4.1 is 

[r(n)r(m)] -1 X n ° . . . X^Y m ° . . . Y™ k 

and since everything commutes except Xi and Yj when i = j, we can write 

[T(n)T(m)]- 1 (X^Yr)...(X^ 



k / 



Moving all coefficients to the right, we must show 

min(n.m) 

(TO...(TO=r(n)r(m) Yl Z (D Y (m-i)X {n -i) 

1=0 

We proceed by induction on k, maximum number of p-digits of either m or n. If 
k = the equation is 

min(no; m o) 

X^Y mo = n lm \ Z (i) Y imo-i)X(n -i) 

1=0 

min(no,mo) . . 

E n !m ! yl V m -l yno-l 
(m» - nur,.* - mr° r ° 

1=0 

min(no,rrao) / \ / \ 

n 0\ ( m 0\ rl v mo-lvn -l 



(m Q -l)\(n -l)W 

£ " (?) (?) z '° Y r- xs- 



The reader may recall that this was exactly the equation to be verified halfway through 



the proof of theorem 13.3.1 in the characteristic zero case for X, Y and Z. Nowhere in 
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that section of the proof did we use the characteristic of the field; the same hypotheses 
hold here for X , Y and Z , and the proof goes through just the same, so we do not 
repeat it. Now suppose the equation is true when n and m have no more than k — 1 
digits. Let n = n k ^ip k ~ l + . . . + n and let n' = n k p k + n k _ip k ~ l + . . . + n , and 
similarly for m. Then by induction we have 



V{n')V{m')X {nl) Y {ml) = [(W) • • • W^Cf 1 )] ( X k" Y D 

min(n,m) \ / min(nj c ,rrtj c ) 



r(n)r(m) £ z (I) y (m _ x (B _ j ( £ z'l^j (™*Jzj^-''x fc " 

= n fe !r(n)m fc !r(m)^ 



¥11 fam-l)***"^ X (n-l)X n k k - 1 ' 



l>\ ) y (m k -l>)\ J \ (n k -l')\ 

Note that these divisions are valid, since for every value of I' in the summation, 
I' < m k ,n k < p. Note also that, since I < p^ 1 and V < p for all values of 1,1' in the 
summation, Lucas' theorem gives that ( l+ \ pk ) = 1 for all such I and I'. For similar 
reasons we have ((™-0+(m*-iV) = (M+^-'V) = L Then we have the identities 



n k W(n) = r(n') m fc !r(m) = r(m') 

z { i)Zi _ _fi + i' P k \ 7 _ 7 

( mfc _ /')! ~~ r (m-O r ((mfc-;')P fc ) - ( m _ J J Y {{m+m k pk)-(l+V)p k ) ~ Y (m'-(l+l'p")) 



and similarly 



n k -V 

V (n'-(l+Up k )) 



X {n-l)X k 



{n k -l')\ 

These substitutions transform the right hand side of our equation into 

= r(n')r(m / ) ^ Z(i + i, p k)Y( m ,_( l+ i, p k- j )X( n ,_( l+ i, p k)) 
i,i' 

But, if we look at the summation limits of I = . . . min(n, m) and V = . . . min(nfc, m k ), 
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we see that it is really a single summation running from to min(n', m'), with l + l'p k 
as the summation variable. That is 

min(ra',m') 

= T(n')T(m) ^ z (i) Y {m'-i)^(n'-l) 
1=0 

which finally gives 

min(n',m') 

X(n')Y(m') — ^ Z{l)Y(m'-l)X(n'-l) 
1=0 

as required. This completes the proof. 

□ 

13.5 Baker- Campbell-Hausdorff Formula For Hi in 
Positive Characteristic 

There is a much more compact way to state all of this. What we have really 
recovered is, in characteristic zero, the familiar Baker-Campbell-Hausdorff formula 
for the group H±, and for characteristic p » dimension, something very close to it. 

In theorem 3.1 of [10] it is proven that, if X,Y and Z are matrices over R such 
that Z = [X, Y] and [Z, X] = [Z, Y] = 0, then 

e x e Y = e x+Y+1 * z 

Our first step is to extend this result to the case of a field of sufficiently large char- 
acteristic when compared to the dimensions of the matrices X, Y and Z, and under 
the additional hypothesis that they be nilpotent. The proof we give is an almost 
exact replica of that given in \X0\ ; the only difference is that we replace the notion of 
derivative with 'formal derivative' of polynomials. 

For the remainder, by a polynomial f(t), we shall mean a polynomial in the com- 
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muting variable t with coefficients which are matrix expressions among the matrices 
X,Y and Z over a given field; for example, /(*) = XY + 2{Z - YX)t + \t 2 - t 3 . 
We define the formal derivative of f(t) in the usual manner; for example, f'(t) = 
2(Z — YX) +Yt — 3t 2 . Then the following facts hold just as well for formal differen- 
tiation as they do for standard differentiation. 

Lemma 13.5.1. Let f(t), g(t) be polynomials, and suppose that the field is either of 
characteristic zero, or of positive characteristic greater than the degrees of both f(t) 
and g(t). 

1. (product rule) (fg)' = f'g + fg' 

2. (uniqueness of antiderivatives) If f'(t) = g'{t), and if /(0) = g(0), then f(t) = 

g(t) 

3. (uniqueness of solutions to differential equations) Let M be some matrix expres- 
sion among X ,Y and Z. Then if fit) = Mf(t), and if g'(t) = Mg{t), and if 
f(0)=g(0), then f{t)=g{t) 

Remark: the assumption that char(/c) > degree is essential. For example, in 
characteristic 2, the derivatives of the polynomials t 2 and are both zero, and they 
are both zero when evaluated at t — 0, but they are obviously not themselves equal. 

Proof. 1. is true even without any hypothesis on the characteristic. Let / = ^^L akt k , 
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g = Y^k=o bkt , where the Oj, 6, are matrix expressions in X, Y and Z. Then 



(/</)'= E 6 ^ 

\fe=0 / \J=0 

2m / \ 

E E «*6i U r 

r=0 \k+l=r / 
m / \ 

= E r E K" 1 

r=0 \fc+/=r / 
2m / \ 

=E r E^ K -1 

r=l \fc+/= r / 
2m- 1 / 

= E( r + 1 ) E a ^) f 



r=0 



and 



a + /</ = E ka ^ x E 6 ^ + E akt 

\k=0 / \l=0 J \k=0 

m m 

k,l=0 k,l=0 
m 

k,l=0 

2m-l / \ 

= E E (k + l)a k bAf 

r=0 \k+l-l=r J 
2m-l / \ 

= E( r + 1 ) E a ^) f 

r=0 \fc+J=r+l / 

which proves 1. 

For 2., let / and g be as before. To say that /' = g' is to say that na n = 
nb n , (n — l)a„_i = (n — l)6 Tl _ 1 , . . . , ai = bi, and to say that /(0) = g(0) is to say that 
ao = &o- Under the given hypotheses all of n, n — 1, . . . , 1 are invertible, which forces 
On = b n , a n _i = . . . ai = b\ and ao = bo, whence / = g. This proves 2. 

For 3., suppose /' = Mf and g' = Mg. Then by matching coefficients for the 
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various powers of t this forces the equalities 



Ma n = Mb n = 

na n = Man-! nb n = Mb n _i 

(n - l)a n _i = Ma n - 2 (n - l)6 n -i = Mb n _ 2 



ai = Ma 61 = M6 

/(0) = g(0) again forces a = 60 ■ Noting again that all of n, n — 1, . . . , 1 are invertible, 
we can work backwards to see that ai = Ma = Mb = 61, that a 2 = ^Ma x = ^M&! = 
&2, • • •, a n = \Ma n _ x = \Mb n _ x = b n , whence / = g. This proves 3. 

□ 

Lemma 13.5.2. Let X and Y be commuting nilpotent matrices over a field k such 
that k is of characteristic zero, or of positive characteristic greater than or equal to 
the dimension of X and Y . Then 

1. (e tx )' = e tx X 

2. (e t2x ^j = e t2x (2tX) 

3. e x e Y = e x+Y 

Remark: the first two are obvious corollaries to the usual chain rule for differen- 
tiation, but the chain rule is in general not valid for polynomials in non-commuting 
coefficients. It is convenient for our purposes just to treat these cases separately. 

Proof. We note firstly that, if char(/c) = p > dim, then all of the above expressions 
make sense, since their series expansions will vanish before we get to see denominators 



227 



divisible by p. For 1., compute: 



fe«)' = [l+tX + t -^ + ... + t A 



2! " ' n! 

v 2tX 2 nt n ~ l X n 

X [1 + tX + ...+ 



(n — 1)1 n\ 

Xe tx 



Note that, in the second to last expression, we are justified in tacking on the term 
since multiplication by X will annihilate it anyway. This proves 1. 



For 2., compute again: 

t 4 X 2 t 2n X n ^' 



2! n\ 

At 3 X 2 2nt 2n ~ 1 X n 
2 « + _ + ... + __ 

2«|l + 2 ^ + ... + " (2< "~ 1 * 



2tX I 1 + t 2 X + ^f- + . . . + — '—— + 



2! n! 

t 4 X 2 t 2(n-l) X n-l jZnyn 



2! "' (n-1)! n! 

2tXe* 2A 



where, again, in the second to last expression, we are justified in tacking on the term 
t —^— since X will annihilate it anyhow. This proves 2. 

For 3., we shall prove that e tx e tY = e t<yX+Y ^ as polynomials; evaluating at t — 1 
gives the desired result. Note that the right hand side is defined; if X and Y commute, 
they can be put in simultaneous upper triangular form, and so X + Y is nilpotent. 



By 3. of lemma 13.5.1 since they are equal when evaluated at t — 0, it is enough to 
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show that they satisfy the same differential equation: 



and 



(e tx e tY )' = (e tx )' e tY + e tx (e tY )' 
= Xe tx e tY + e tx Ye tY 
= e tx e tY (X + Y) 

(J(X+Y)\> _ J(X+Y) 



)' = e « x+Y \X + Y) 



This completes the proof. 



□ 



Lemma 13.5.3. Let X and Y be nilpotent matrices over a field, commuting with 
their nilpotent commutator Z . If the field is either of characteristic zero, or of positive 
characteristic larger than twice the dimension of the matrices, then 



e x e Y = e x+Y+ ^ z 



Proof. We shall prove something stronger, namely that 



e tx e tY = e tx+tY+t -¥ z 



as polynomials; evaluating at t — 1 will give the desired result. 

We note first that if char = p > 2dim, all of the above expressions make sense, 
since e.g. the series expansion for e tx will vanish before we get to see denominators 
divisible by p. Note also that the results of the previous two lemmas apply, since the 
maximum degree of any of the above polynomials is 2dim — 2. 

Note also that tX + tY + must be also be nilpotent. If X,Y and Z are matrices 
satisfying the given hypotheses, they define a representation of H\ according to either 



theorem 13.3.1 or theorem 13.4.5 As any representation of a unipotent algebraic 
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group can be put in upper triangular form, it follows that X, Y and Z can be put in 
simultaneous upper triangular form. It is obvious then that any linear combination 
of X, Y and Z is nilpotent, and so the right hand side makes sense as well (except 
when p = 2; but this forces dimension to be < 1, and in this case the result is trivial). 

The proof proceeds exactly as in theorem 3.1 of [10] for the Lie group case. Since 
Z commutes with both X and Y, we can rewrite the above equation as 

e tX e tY e =f Z = e t(X+Y) 



Denote by A(t) the left hand side of this equation, B(t) the right hand side. These 



are both equal to 1 when evaluated at t = 0, so by 3. of lemma |13.5.1| it suffices to 
show that they both satisfy the same linear differential equation. Working first with 
A(t), using the iterated product rule we have 

A\t) = e tx Xe tY e=^ z + e tx e tY Ye=^ z + e tx e tY e=^ z (-tZ) 

= e tx e tY (e- tY Xe tY ) e^ z + e tx e tY Ye^ z + e tx e tY e^ z (-tZ) 

We claim that e~ tY Xe tY is equal to X + tZ. They are both equal to X when evaluated 



at t = 0, and (X + tZ)' = Z, so it suffices to show by part 2. of lemma 13.5.1 that 
the derivative of e~ tY Xe tY is equal to Z: 



(e- tY Xe tY )' = e- tY (-YX)e tY + e- tY XYe 
= e~ tY (XY -YX)e tY 

-tY ryJY 



tY 



Ze l 
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as required. Thus 



A'(t) = e tx e tY (X + tZ) e=^ z + e tx e tY Ye=^ z + e tx e tY e=^ z (-tZ) 



= e tx e tY e^ z (X + tZ + Y-tZ) 
= e tx e tY e^ z (X + Y) 
= A(t)(X + Y) 

To finish then, it suffices to show that B'(t) = B(t)(X + Y); but this is obvious by 



part 1. of lemma 13.5.2 This completes the proof. 

□ 

Theorem 13.5.4. Let M(x,y,z) be (the matrix formula for) a finite dimensional 
module for Hi in characteristic zero given by the nilpotent matrices X, Y , and Z , in 



the notation of theorem 13. 3.1 Then 



M(x,y,z) = e xX+yY+ ^ z - xy/2)z 

Proof. We will first prove that the formula given is actually a representation of Hi, 
which amounts to verifying the matrix equality 



e xX+yY+(z-xy/2)Z e rX+sY+{t-rs/2)Z _ e (x+r)X+(y+s)Y+(z+xs+t-(x+r)(y+s)/2)Z 



If X and Y are nilpotent and commute with their nilpotent commutator Z, then 
xX + yY and rX + sY are also nilpotent, and also commute with their nilpotent 



commutator (xs — yr)Z, and so lemma 13.5.3 applies: 



(xs — vr) 

exp(xX + yY)exp(rX + sY) = exp((x + r)X + (y + s)Y + ^—^Z) 



Recalling that e R e s = e R+s whenever R and S commute, the left hand side of our 
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first equation can be written 



exp(xX + yY)exp(rX + sY)exp((z + t — (xy + rs)/2)Z) 

XS — VT 

= exp((x + r)X + {y + s)Y H ^-Z)exp((z + t-(xy + rs)/2)Z) 

= exp((x + r)X + (y + s)Y + (z + xs + t - (x + r)(y + s)/2)Z) 

The expression given in the statement of the theorem is therefore indeed a representa- 
tion of Hi. To see that they are equal, simply verify that, in the notation of theorem 
the matrix (cy)^ 1 ' 0,0 ' is actually X, and the matrix (cijp ' 1 ' ' is actually Y; as 



13.3.1 



these completely determine the rest of the representation, we conclude that M(x, y, z) 
and the given expression are in fact equal. □ 

In the characteristic p > case, if we assume p > 2dim, we obtain a result 



analogous to theorem 12.3.1 for the group G a . 



Theorem 13.5.5. Let k have characteristic p > 0, and suppose p > 2d. Then every 
d- dimensional representation of Hi over k is of the form 



e xXo+yY +(z-xy/2)Zo e xPX 1 +yPY 1 + (zP-xPyP/2)Z 1 ^ X m +yP m Y m + (zP m -x" "V™ /2)Z m 



with all of the factors commuting. Further, any collection Xi,Yi,Zj of d- dimensional 



matrices satisfying the hypotheses of theorem \13.4-Q\ gives a representation according 
to the above formula. 



Proof. In the notation of theorem 13.4.5 let X , . . . , X s , Yq, . . . , Y s , Z , . . . , Z s be 
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given. Then the matrix formula for the representation they define is 



M{z,y,z) = (^) (n ' m ' k) x n y m z k = £ Z (k) Y {m) X (n) x n y 



' m ^x n y m z k 

n,m,k n,m,k 



m z k 



r(n)- 1 r(m)- 1 r(A;)- 1 Z fco . . . Z s Y mo . . . Y s ms X%° . . . X r s 



n,rn,k 



x no+nip+...+n s p s ymo+m 1 p+...+m s p s z ko+k!p+...k s p a 



p-l 

E 



1 



\no,mo,fco=0 

p-l 

E 



n \m \k \ 



Zq°Y™° Xq° x n ° y m ° z m ° 



1 



\n s ,in s ,k s =0 



n s \m s \k s \ 



gk s Y m s J^n s x "n s p s yiri s p s ^k s p 



y 



We note that, for fixed r, the matrices X r , Y r , and Z r , by theorem |13.4.5[ satisfy the 

we see 



hypotheses of lemma 13.5.3 Working through the proof of theorem 13.5.4 



that nowhere was the characteristic of the field used; only that lemma 13.5.3 



was 



satisfied. In other words, theorem 13.5.4 establishes a purely combinatorial fact that, 



if Xq, Yq, Z satisfy lemma 13.5.3, then 



E 



no,mo,fco=0 



n \m \k 



Z^Y Q mo X^x no y mo z mo = exp(xX + yY + (z - xy/2)Z ) 



whenever m is greater than or equal to the nilpotent orders of Xq, Yq, and Zq. We 
conclude that 



p-i 

E 

no, mo, fco =0 



1 



n \m \k 



Z^Y mo X^x no y mo z mo = exp(xX + yY + (z - xy/2)Z ) 



We can of course replace x, y and z with x p , y p and z p to likewise obtain 



E 



1 



n r \m r \k r \ 



z k rY rnr X nr x nr P r ymrP r z kr P r = exp ( x P r X r +y pT Y r + {z pr -X^y^ /2)Z r 
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for any r, and hence 



p- 1 

— i — m z o r A X V Z 
n \m \k \ 



\no,mo,fco=0 
p-1 



1 

g k sY m » x ns x nsP " v mspS z ksp3 



vn s ,m s ,fc s =0 



n s !m s !£; s ! 

= exp(:rX + yF + (* - xy/2)Z )exp(x p X 1 + y p Y x + - 

. . . exp(x pS X s + y pS Y s + - rr^V S /2)Z S ) 

which proves the theorem. Note that all of the factors commute, since so do X iy Y iy 
and Xj, Yj, Zj when i ^ j. 
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Chapter 14 



The Height-Restricted 
Ultraproduct 



In the previous chapters, we saw that for the unipotent groups G that we studied, 
the representation theories of G n in characteristic zero and for G in characteristic 
p » dimension are in perfect analogy. The appropriate context, we believe, in 
which to interpret these results is in consideration of the so-called height-restricted 
ultraproduct, which we formally define now. 

Let G be any of our so far studied unipotent groups, and let A; be a field of 
characteristic p > 0. They all have Hopf algebras isomorphic to k[x\, . . . ,x n ] for 
some n, so the following definition makes sense: 

Definition 14.0.1. The height of a representation M of G over k is the largest m 

„m — 1 

such that, for some i, x\ occurs as a coefficient in the matrix formula of M. In 
case no such occurs (i.e. M is a trivial representation), we say M has height zero. 

Since all of the Hopf algebras at issue are isomorphic to k[xi, . . . , x n ], height is an 



isomorphism invariant. Lemma |8.0.10| shows that applying any base change to the 
matrix formula of a representation yields two matrices, each of whose entries will be 
linear combinations of the entries of the other. 
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Example: the representation 



1 x + xP' 



\ 







1 



/ 



for G a has height 3. 

If p is large with respect to dimension, we know that every representation of G a 
or Hi can be factored into a commuting product of representations, each accounting 
for one of its Frobenius layers. In this case the height of M is equal to the number of 
these layers (even in the case of a trivial representation, which has no layers), hence 
the motivation for the definition. 

Let ki be a collection of fields of strictly increasing positive characteristic, Cj = 
Rep^a 

Definition 14.0.2. The height of an object [X;] of Y\ R Ci is defined to be the essential 
supremum of (height(X;) : % e /}. The height-restricted ultraproduct of the Cj, 

denoted n ff ^> * s the full subcategory of \\ R Ci consisting of those objects having finite 
height. For n G N, we denote by [] ff<ii Ci the full subcategory of Yl H Ci consisting of 
those objects of height no greater than n. 

Note that, as a subcategory of [j B Cj, we demand the objects of Y[ H Ci to ^ e °f 
bounded dimension as well as height. 

The remainder of this chapter is devoted to proving 

Theorem 14.0.6. Let G be any of the so far studied unipotent groups, ki a sequence 
of fields of strictly increasing positive characteristic. Then Y\ H Rep k .G is a neutral 
tannakian subcategory of Yl R Rep k .G, and is tensorially equivalent to Rep nuk .G°° . 
Likewise, for any n &N, Y\ H<n Rep ki G is tensorially equivalent to Rep Uuk .G n . 

Note that the groups G°° and G n in this theorem are independent of the choice 
of non-principal ultrafilter, and while the field Y\ u ki does indeed vary, it will in all 
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cases have characteristic zero. 

We shall prove the theorem for the group Hi, leaving it to the reader to convince 
himself that the same proof applies to the group G a . The proof is quite straightfor- 
ward; we shall construct an explicit equivalence between the two categories, show that 
it is tensor preserving, and it will be immediate that the usual forgetful functor on 
Rep nuki G°° can be identified as the restriction of the fibre functor uj : Y\ R Ci — > Vec nw ^ 
defined in chapter [7] to n^R-ePfe;^- 

For the remainder, let G denote the group Hi, ki a sequence of fields of strictly 
increasing positive characteristic, C, = Rep fc .G, and k = Ylu^i- 



14.1 Labelling the Objects of 

Now, what does an object tyl] of Y\ H Ci actually look like? Firstly, the vector spaces 
Vi are of bounded dimension. And since the ki are of strictly increasing characteristic, 



this tells us that, for all but finitely many i, Vi is of the form given by theorem 13.4.5 
that is, it is determined by a finite sequence of nilpotent transformations on V, 



^0) • • • ' ^-mi I ' • • • ' 1 mi ^0' • • ■ J 



according to the formula given in theorem 13.5.5 and under the conditions given in 



theorem |13.4.5| Secondly, it is of finite height, whence we can take m to be constant 
for almost every i. 

Theorem 14.1.1. Each object [V] of\\ H Ci is completely determined by a sequence 
Xq, . . . , X m , Y , . . . , Y m , Z , . . . , Z m of linear transformations on Y\ u Vi satisfying 

1. The Xj, Yj, and Zj are all nilpotent 

2. Zj = [Xj, Yj] for every j 

3. [X J ,Z 3 ] = [Y J ,Z J } = 0foreveryj 
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4- For every i j, the matrices Xi,Yi, Z^ commute with Xj, Yj, Z 3 



Further, any such sequence of linear transformations on a finite dimensional vector 
space over k gives an object ofY\ H Ci- 

Proof. We define these transformations as the ultraproduct of the transformations 
given above: 

= [Xq], X m = [X£J 
Yo = [Yq] ,Y m = [X l m ] 
z o = [Zq], ■ ■ ■ , Z m = [Z l m ] 



By theorem 6.2.9, all four of the above conditions are valid among the Xj, Yj and Zj 
if and only if, for almost every i, all four are valid among the Xj,YJ and Zj. Thus 
every object of Y\ H Ci determines such a collection of transformations on a ]^[ M /cj- vector 
space. 

Conversely, suppose that we are given a sequence X , . . . , X m , Y , . . . , Y m , Z , . . . , Z % 
of linear transformations on an n-dimensional fcj- vector space V satisfying all 
the above; we claim there is an object [Vi\ of flaA) unique up to isomorphism, to 



which these transformations correspond. By proposition 6.2.4 let Vi be a collection 
of n-dimensional fcj-vector spaces such that Y\ u Vi ~ V. By proposition 6.2.6, X is 
uniquely of the form [Xq], where each X l Q is a linear transformation on V^; the same 
goes for all of the X^, Y^ and Z^. Finally, note that given relations among the Xk, Y^ 
and Zk amount to a finite number of equations involving composition of maps, and 



so by theorem 6.2.9 these relations are almost everywhere valid among the XI, Y^ 



and Z\. As such, almost everywhere, they define a valid ifi-module structure on Vi 



according to theorem 13.4.5, The object we seek then is [Vj]. That [Vi] is unique 
up to isomorphism is clear from the description of morphisms in [] ff Cj given in the 
following paragraphs. □ 
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And what about morphisms? By definition, a morphism [fa] : [V^] — > \Wj\ in the 
category Y\ H Ci is such that, for almost every i, fa : — > Wi is a morphism in the 



category Cj. And by theorem |10.1.1[ for large enough i, such fa are exactly those 

this is 



6.2.9 



which commute with the Xj, YJ, and Zj for every j. Again by theorem 
equivalent to saying 

Theorem 14.1.2. Let V, W be objects ofY[ H Ci, given by (according to the previous 
theorem) the transformations Xj,Yj and Zj for V and Rj,Sj and Tj for W . Then 
= [(pi] is a morphism between V and W if and only if, for every j , <fr satisfies 

Xj O = O Rj Yj O = O Sj Zj O = O Tj 

We can therefore identify the category as the collection of all finite di- 

mensional vector spaces V over k = Yl u ^^ each endowed with a collection of linear 
transformations 

Xq, • • • , A m , Y , . . . , Y mi Z , . . . , Z m 



satisfying the relations given in theorem 14.1.1 , with morphisms being those linear 



maps commuting with the X's, Y's, and Z's. 



14.2 Labelling the Objects of Rep^G 00 



Let k = Yl u ki. What does the category Rep k G°° look like? By theorem 11.0.3 
representations of G°° on the A;-vector space V are exactly the finite commuting 
products of representations of G on V. And as k has characteristic zero, according 



to theorem 13.3.1, an individual representation of G on V is determined by a triple 
X,Y,Z of nilpotent linear transformations on V satisfying Z = [X,Y], XZ = ZX, 
YZ = ZY, and any such triple gives a representation. Thus, every object of Rep fc G°° 
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is a finite dimensional vector space V with an attached collection 



Xq, . . . , X m , Yq, . . . , Y m , Zq, . . . , Z m 

of nilpotent linear transformations on V, such that the representation determined by 
Xi,Yi,Zi commutes with the representation determined by Xj,Yj,Zj for i ^ j. By 



theorem 10.1.2, commutativity of these representations is equivalent to requiring that 



Xi,Yi, Zi all commute with Xj, Yj, Zj for i ^ j. Thus 

Theorem 14.2.1. Each object of Rep k G°° is a finite dimensional k-vector space V 
with an attached sequence X , . . . , X m , Y , . . . , Y m , Z , . . . , Z m of linear transforma- 
tions on V satisfying 

1. The Xj, Yj, and Zj are all nilpotent 

2. Zj = [Xj, Yj] for every j 

3. [X 3 ,Zj\ = [Y, J ,Z ] } = U for every j 

4- For every i ^ j , the matrices Xi,Yi, Zi commute with the matrices Xj, Yj, Zj 

Further, any such sequence of linear transformations on a finite dimensional vector 
space over k gives an object of Rep k G°° . 

Let V, W be objects of Rep fc G°°, given by the nilpotent transformations X , . . . , X m 
Y ,..., Y m , Z ,...,Z m and R , . . . , R m , S , . . . , S m , T ,...,T m respectively. What is 
a morphism between these two objects? By proposition 11. 1.3| it is a linear map 



: V — > W such that, for every j, <p is a morphism between the representations 



of G on V determined by Xj,Yj,Zj and Rj,Sj,Tj. And by theorem 10.1.1 this is 
equivalent to 

Theorem 14.2.2. Let V and W be objects of Rep k G°° , given by the transformations 
Xj, Yj and Zj for V and Rj, Sj and Tj for W . Then a linear map : V — >■ W is a 
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morphism between V and W if and only if, for every j, satisfies 



XjO(j) = (j)oRj Yj o (p = <p o Sj Zj o (p — (j) o T,j 



.1 



We can therefore identify the category Rep fc G°° as the collection of all finite dimen- 
sional vector spaces V over k, each endowed with a collection of linear transformations 

-Xo> • • • j X m , Yq, . . . , Y m , Zq, . . . , Z m 



satisfying the relations given in theorem 14.2.1 , with morphisms being those linear 



maps commuting with the X's, Y's, and Z's. 



14.3 The Equivalence H H d -> Rep^^G 00 

With the characterization for objects and morphisms in the categories J^f H Rep fci G 
and Repj-^.G 00 given in the previous two sections, the equivalence (as /c-linear abelian 
categories) is obvious. 

What is left to verify is the not quite obvious fact that this equivalence is tensor 
preserving. First, let us examine the tensor product on Rep^G 1 . Fix two objects V 
and W, given by the transformations X, Y, Z and R, S, T on V and W respectively. 
Then their tensor product has the matrix formula 

l^2(c ij ) (n ' m>k) x n y m z k J ® (^(^) (r ' s 'VyV 

\n,k,k / \r,s,t / 

= J2^ in ' m ' k) ® {di3) {r,s,t) x n+r y m+s z k+t 

n,m,k) r,s,t 

Recalling that (c^-)*™'" 1 '^ = n ^ M Z k Y m Z n and similarly for (dij), we see that the 
coefficient matrix for x in the representation V ® W is actually X <g) R, that for y is 



241 



Y <g) S, and for z is Z <8> T. 

Now consider two representations and W for G°° over k, given by the transfor- 



mations Xj, Yj, Zj and R~, Sj, T~ respectively. Proposition 11.1.2 tells that the 'layers 



for the tensor product of V and W is just the tensor product of the individual layers. 
That is 

Proposition 14.3.1. The tensor product of the representations V and W is given by 
the sequence of transformations Xj (g) Rj, Yj £g> Sj, Zj ® Tj. 

In positive characteristic the situation is slightly more delicate. Consider for 
example the natural representation of Hi 



( i \ 

1 x z 



i y 



\ 



1 



where we consider it as a representation of G in characteristic 2, that height- 1 

representation given by the matrices 



^010^ 








^ ^ 



Yn 



J 



V 



1 




V 



1 \ 


j 



The tensor product of this representation with itself is a 9 x 9 representation in which 
x 2 will occur as an entry, causing it to have height 2. We see then that, in general, 
the taking of tensor products in positive characteristic often causes Frobenius layers 
to 'spill over' into one another, and we do not always have a situation analogous to 



proposition 14.3.1 



However, because objects of f| H Rep fc .(j are demanded to be of bounded dimen- 
sion, for large enough % this difficulty will always vanish. 



242 



Proposition 14.3.2. Let k have characteristic p > 0, and let V and W be G-modules 
over k, say of height m, given by the transformations Xj,Yj, Zj and Rj, Sj, Tj. Then 
if p is large compared to the dimensions of both V and W , the representation V <8> W 
will also be of height m, given by the transformations 

X j ®R j ,Y j ®S j ,Z j ®T j 

for all j — 0, ... , m. 

Proof. We will first prove the theorem in the case where V and W contain a single, 
mutual non-zero Frobenius layer, say the r th layer, given by X, Y, Z and R, S, T; the 
case of an arbitrary number of layers is an easy corollary. We show that, if p is large 
compared to the dimensions of both, there is no possibility of that layer 'spilling over' 
into the next one. We can assume that p is large enough so that all of V, W, and 



y®V^ are of the form given by theorem 13.4.5, and in particular, that all the relevant 



matrices have nilpotent order < p/2. This means that we can write the representation 
V as 

P/2-1 1 

V -^^z k Y m X n x npr y mpr z kpr 
^— ' n'.m'.k'. 

n,m,k=0 

and W as 

p/2-l 



V — T c S b R a x apr y bpr z cpT 



a\b\c\ 

a,b,c=0 



and their tensor product as 
p/2-1 

V (Z k Y rn X n (8) T c S b R a )x {n+a)pr y {m+h)pr z {k+c)pr 

n,m,k,a,b,c=0 

Note that this new representation still only has a single non-zero Frobenius layer, the 
r th one, since, e.g., {n + a)p r < p r+l for every n and a in the summation. Notice also 
the coefficient matrix for the monomial x pr is exactly that for y pT is exactly 
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Y <g) S, and that for z pr is Z <g) T. Thus the theorem is true in the single layer case, 
and the case of an arbitrary number of layers easily follows. □ 

So, given two objects [V^],[Wj] of Y[ H ^Pkfi, for large enough % the previous 
proposition applies, whence the tensor products on J7 H Rep fci G and Rep n ^ ki G°° , via 
our equivalence, are compatible, and this equivalence is indeed tensor preserving. 

We leave it to the reader to convince himself that all of the arguments of this 
chapter can be slightly modified to prove 

Theorem 14.3.3. If ki is a sequence of fields of strictly increasing positive charac- 
teristic, then for any n G N, the category Y\ H < n ^ e PkiG is tensorially equivalent to 
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Chapter 15 

Height-Restricted Generic 
Cohomology 



One application of theorem 14.0.6 of the previous chapter is to give quick and 
intuitive large characteristic, 'height-restricted' generic cohomology results for the 
two unipotent groups we have studied, at least in the case of Ext 1 . 



15.1 First-Order Definability of Ext 1 

Let M and N be objects in some tannakian category over the field k, fix n, and 
let £1, . . . , Cm be a sequence of diagrams of the form 



£j : —t N —t Xj —t M —t 



To prevent ourselves from having to repeat the same long-winded sentence over and 
over again, we define the formula LISE(£i, . . . , £ m , M, N) to mean " £x, . . . , £ m is a 
linearly independent sequence of 1-fold extensions of M by iV." 

Theorem 15.1.1. For fixed m, the formula LISE(£i, . . . , £ m , M, N), modulo the the- 
ory of tannakian categories, is expressible as a first-order formula in the language of 
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abelian tensor categories. 

Proof. A full proof that LISE is a first-order formula would be an unnecessarily mind- 
numbing exercise; we shall instead be content to give an outline of such a proof, leaving 
it to the reader to fill in the necessary details. 

Define the formula Exten(iV, l, X, ir, M) to mean that these objects and morphisms 
comprise a 1-fold extension of M by N . This amounts to demanding that N, X, and 
M are objects, that i and n are morphisms, that the morphisms point between the 
objects we want them to, that i is injective, that 71 is surjective, and that the sequence 
is exact at X. This can be translated into a first-order sentence. 

Now the formula LISE(£i, . . . M, N) doesn't make sense on its face, since we 
are treating the extensions as if they were elements of our category, which they are 
not. If we were being strictly formal, we should instead use the objects and morphisms 
comprising the extensions as the variables, and make the additional assertions that 
they are all extensions of M by N. But since Exten is first-order, this can certainly 
be done, so we are justified in using this abbreviation. 

The formula LISE(£i, . . . , £ m , M, N) should go something like, . . . , £ m are ex- 
tensions of M by N, and for any scalars ki, . . . , k n , if k±^ © ... © k n ^ n is equivalent to 
the trivial extension, then k± — . . . — k n — 0." It is then simply a matter of showing 
that the concepts of being a scalar, of scalar multiplication of extensions, of Baer 
sum of extensions, of being a trivial extension, and of being the zero scalar are all 
first-order. 

The first is obvious; to be a scalar simply means that it is an endomorphism of 
the identity object, which is clearly first-order. If : X — > Y is a morphism and k a 
scalar, then we define the scalar multiplication of k on to be the composition 

X^l®X^l®Y^Y 
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which again is first-order. This allows us, under the definition given in section 2.4 
to define the scalar multiplication of an extension in a first-order way. (We would 
of course have to make separate definitions for the case when k = or k 7^ 0; but 
this is no problem, since u k is the zero scalar" and "£ is a trivial extension" are both 
first-order, as is shown below.) 

As for the Baer sum, we ask the reader to see the definition of it given in section 



2.4 It involves such concepts as "being a pullback" , "being the unique map pushing 
through a pullback", "being a cokernel", "being the unique map pushing through a 
cokernel", etc. All of these concepts are expressed in terms of universal properties, 
which are quite amenable to being expressed in a first-order fashion. They simply 
state that, given a collection of morphisms making such and such a diagram commute, 
there is a unique morphism making such and such a diagram commute. These types 
of statements are plainly first-order. 

The statement that two extensions are equivalent is first-order; it is merely the 
assertion that there exists a morphism making an equivalence diagram between the 
two extensions commute (this is the only point in the proof at which it is necessary 



to restrict to Ext 1 as opposed to higher Ext; see section 15.3 for more on this). Then 
to say that x is equivalent to the trivial extension would go something like "if £ is 
any extension, then £ © x is equivalent to £" . 

Finally, to say that the scalar k is the zero scalar is simply to say that it is the 
additive identity of the field End(l), which is clearly first-order. 

□ 

Corollary 15.1.2. For fixed n, the formula "dim Ext 1 (M, N) = n" is first- order. 

Proof. It is equivalent to "there exist n linear independent 1-fold extensions of M by 
N, and there do not exist n + 1 of them" . □ 
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15.2 Generic Cohomology for Ext 1 

In this section G denotes any unipotent group, defined over Z, for which the 



conclusion of theorem 14.0.6 is true (and in particular, has Hopf algebra isomorphic 
to A = Z[xi, . . . , x n ] for some n), ki is a sequence of fields of strictly increasing positive 
characteristic, Cj = Rep^G, and k is the ultraproduct of the fields ki. 

Definition 15.2.1. Let A; be a field of characteristic p > 0, M and N modules for G 
over k, and let n,h G N. Then we define 



Ext^ (M,N) 



to be the subset of Ext% k \(M,N) consisting of those (equivalence classes of) n-fold 
extensions of M by iV such that, up to equivalence, each of the extension modules 
can be taken to have height less than or equal to h. 

Example: the (equivalence class of) the extension 



-> k -> 



( 1 xP 2 ^ 



1 



->■ k -> 



is a member of Ext^^A;, k), but not of Ext^^Jk, k). 

Lemma 15.2.1. If M is a G-module of height no greater than h, then any submodule 
or quotient of M also has height no greater than h. If M and N have height no 
greater than h, so does M ® N . 



Proof. The case of subobjects and quotients follows immediately from lemma 8.0.10 
any subobject or quotient of M will have matrix formula with entries who are linear 
combinations of the entries of M. The case of M © N is even easier to see, examining 
the usual matrix representation for a direct sum. □ 

248 



Theorem 15.2.2. Let M and N be modules for G over a field k of characteristic p > 
0, of height no greater than h. Then Ex^ k JM,N) is a subspace of Ext^^(M, N) . 

Proof. Let £,% be extensions in Extj™^ (M,N), with the extension modules of both 
£ and \ having height no greater than h. We examine the definitions given for the 



Baer sum and scalar multiplications in section 2.4 Clearly a non-zero scalar multiple 



of either of them is still in Extg,^ (M, N) . As for the Baer sum £ © x, we recall the 
concrete constructions of a pullback or pushout of G-modules. The former is defined 
as a certain subobject of the direct sum of two modules, and the other a certain 
quotient of their direct sum. By the previous lemma both of these constructions yield 
modules of height no greater than those of the originals. Thus Ext^^ (M, N) is closed 
under the Baer sum. The trivial extension has extension module isomorphic to the 
direct sum of M and N, again by the previous lemma, of height no greater than that 
of M or N. □ 

Let M and iV be modules for G over Z. Then it of course makes sense to consider 
them as modules for G over any field. Further, for any n e N, and indeed for n = oo, 
we can consider them as modules for G n over any field, that is, as representations 



with a single layer, according to theorem 11.0.3 



Our goal for the rest of this section is to prove 

Theorem 15.2.3. Let ftGN, M, N modules for G over Z. Suppose that the compu- 
tation dim ExtQh^\(M, N) = m is the same for any characteristic zero field k. Let ki 
be a sequence of fields of strictly increasing characteristic. 

1. If m is finite, then for sufficiently large i 

dim ExtQ, k ,JM, N) — m 

2. If m = oo, then for any sequence of fields ki of strictly increasing characteristic, 
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dim ExtQ, k .JM , X) diverges to infinity with increasing i. 
For each i, suppose we have a diagram in 

C : -> Nt Xi Mi -> 

Denote by corresponding diagram in n w ^ : 

[C] ■■ -> [Nt] ^ [X] [Mi] 

Proposition 15.2.4. For each i, let . . . 6e i/ie sequence of diagrams in d 

£ I : -»■ JV< X/ -> Mj — )• 

£f : ->■ iVi -> Xf ->■ Mi -> 

T/ien i/ie formula LISE(Q, . . . , £ 2 m , Mj, Xj) /io/ds m almost every d if and only if the 
formula LISE([£], [£"], [M t ], [X]) holds in fLA 



Proof. Apply theorems 15.1.1 and CO. 15 □ 



Now fix two modules M and X for G over Z, and for each i, let . . . , £|" be the 
sequence of diagrams in Cj 

£ • : -> X -> X/ -> M -> 

£f : ->• X ->• X™ ->■ M -> 

Further, assume that each £| is a member of Ext^.^M, X); this is merely the asser- 
tion that every X\ has height no greater than h. As M and X are constant over i and 
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have height 1, each of [M], [X], and [Xf] have bounded height, whence = [£{] is 
an extension in [j H<ji Ci. 

Proposition 15.2.5. For fixed n, h G N and modules M and X for G over Z, 
the statement <e dimEx^ k JM,N) > n" /io/ds /or almost every i if and only if 
d i mExt 1 Gh(nukt) (M,N)>n. 

Proof. Suppose dim Ext^^M, X) > n holds for almost every i. This means that, 
for almost every i, we have a linearly independent sequence of 1-fold extensions of M 
by X 

£ I : -> iV -> X/ -> M -> 



£f : -> X -> X™ ->■ M ->■ 

with each X| being of height < ft,. Note that the objects [X,] 1 , . . . , [Xj] m are of 
bounded height and dimension; then these project to the sequence of diagrams 
. . . , [£i] m in riiKh^i- A s the formula LISE is first-order, these extensions, con- 
sidered as diagrams in the full ultracategory fTw^' are a ^ so linearly independent, and 



by proposition 5.0.4, so also are they in the undercategory n^^Cj- Note also that, 
under the equivalence [j fl<Ji Ci ~ Rep Uuki G h given in section 14.3 the objects [M] 



and [X] in [] H<)i C i actually correspond to the objects M and X in Kep u ^ ki G h . This 
gives a collection of m linearly independent extensions of X by M in the category 

He Pn u k i Gh ; thus > dimExt GHn w fc i )( M > iV ) ^ n - 

The converse is proved similarly; if dim ExtQh^^^^iW, X) > n, take a linearly 

independent sequence of extensions . . . , [£j] n of X by M in riff<h^> which project 

back to, for almost i, a linearly independent sequence of extensions £f of M 

by X in d, showing dim EiXb^,, k JM, X) > n for almost every i. □ 
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Corollary 15.2.6. The statement dim Ext^ h ^ k ^(M, N) = n holds if and only if 



dim Ext. 



Q, k a(M, N) = n holds for almost every i. 

Proof. The above statement is equivalent to the conjunction "dim Ext^^ fc .)(M, N) > 
n" and NOT "dim Ext^^ ^{M, N) > n + 1". Apply the previous proposition. □ 



We can now prove theorem 



15.2.3 



Suppose that the computation dim Ext^^M, N) 
n is both finite and the same for any characteristic zero field k. Then in particular, 
it is the same for the field Yl u ki for any choice of non-principal ultrafilter. Let J c I 



l,h 



be the set on which dim Ext^ JM, N) = n is true. By the previous corollary, J is 



large for every choice of non-principal ultrafilter, and by corollary |B.0.8[ J is cofinite. 
This proves the first part of the theorem. 

If instead dim Exk Gh s k \(M, N) is infinite, then for any n £ N, the statement 



dim Ext 



l,h 
G(ki) 



n is false for almost every i, for every choice of non-principal ul- 



trafilter, whence the statement dim Ext 



l,h 
G(ki) 



n is false on a cofinite set. This goes 



for every n G N, whence dim Ext^ ^ is divergent. 



15.2.1 An Example 



We shall illustrate an application of theorem 15.2.3 with a simple, easily verifiable 
example. 

Let G = G a and consider Ext^^/c, k), where k has characteristic p > 0. As 
the extension module of any extension of k by k has dimension 2, and as p > 2 for 



all primes, theorem 12.3.1 applies, whence any 2-dimensional representation of G a is 
given by a finite sequence Xq, . . . ,X m of commuting nilpotent matrices over k. We 
can take X to be in Jordan form times some scalar, namely 



c 
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for some scalar cq. The centralizers of this matrix are exactly those of the form 



( h \ 

a o 



i a j 



and if we demand them to be nilpotent, we must have a = 0. Thus the Xi can be 
taken to be 



c 



i • • • i X m 



c r , 







/ 



for some scalars Cq, . . . ,c m . The representation they generate according to theorem 
I12.3.1l is 

/ \ 

1 CqX + CiX p + . . . + c m x p 



\ 







/ 



These are all extensions of k by k with the obvious injection 1 i — >■ (1,0) and projection 
(1, 0) i — y 0, (0, 1) i — y 1, and any extension of k by k must be of this form. Therefore 
extensions of the form 



1 CqX + C\X V + . . . + C m X p 

^ k^ | |^A;-^0 
1 



constitute all extensions of k by k. Denote by £ m the extension 



£ m : — >• A; — >• 



1 Tl m ^ 

1 x p 



1 



->• A; -4 



Then direct computation shows that the Baer sum of £ m and £ n is the extension 



£m © £n : y k y 



( 1 x pm +x P n ^ 
1 



-> fe ->■ 
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and that, for c 7^ 0, the scalar multiplication c£ m is (equivalent to) 



c£ m : ->■ k -> 



1 o m 
1 CX P 



i° 1 y 



k -> 



A basis for Extgw^A:, A;) is therefore given by .... If instead we restrict to 



Exhb, ,,Jk, k), then this is a finite dimensional subspace spanned by £0, • • • , 61-1 • 



Now consider Exk G ht k \(k, k), where k now has characteristic zero. Using theorems 
12.2.1 and 11. 0.3| virtually identical computations to the above show it to be spanned 
by the linearly independent extensions xo, ■ ■ ■ , Xh-u given by 



J. Xrr?. 



V 



1 



-> fc -> 



/ 



where x m denotes the m th free variable of the Hopf algebra fc[xo, • • • , Xh-i], an d that 
the Baer sum and scalar multiplication of extensions give analogous results to that 
of the above. We see then that 



dim Extg,^ (A;, k) = dim Ext^^A/, k') 



when k has characteristic p and k' has characteristic zero. In particular we conclude 
that, if ki is a sequence of fields of increasing positive characteristic, then 



dim Ext 



l,h 

G a {ki) 



(ki, ki 



dimExt^ (nwfci) (n^,n fc i) 



which is predicted by theorem 15.2.3 



The reader should note that this example is misleading, in that the generic value 
of EiXtQ h ( k Jk,k) is attained for any positive characteristic p > 2. This was simply 



due to the fact that theorem 12.3.6 applies to all characteristics in dimension 2, 
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i.e. because any 2x2 nilpotent matrix is nilpotent of order < 2. If instead we were 
to consider Ext^ (/^ k^ki © fcj), then (assuming the computation on the right 
does not depend on the particular characteristic zero field) we would still have 

dim ExtJ£ (fci) (fci © fci, fci © fcj) > dim Ext^^dl&i © U k h U k i © 11**) 

u u u u 

only this time we would have to wait for char(/cj) = 5 for the generic value to be 
obtained. 

15.3 The Difficulty with Higher Ext 

To finish, we mention a few of the reasons why our attempts to apply this ma- 
chinery to Ext n for n > 1 have so far proved unfruitful. 

In the previous section we saw that there is a 1 — 1 correspondence between 
extensions in Ext^^^M, N) and almost everywhere extensions in Ext^.^M, N). 
But for higher Ext, this does not always work. Here is what can go wrong. For 
concreteness' sake consider Ext^.^M, N), and suppose that, for each i, we have an 
element & G Ext^.-^M, N). This means that each & is of the form 

Zi-.O-t M -> Xi->Yi-> N ->Q 

with every Xi and Y{ being, up to equivalence of extensions, of height < h. But this 
says nothing about the dimensions of Xi and Yi, and indeed there is every reason 
to suspect that dim(JQ) and dim(l^) diverge as % becomes large. As the objects of 
[\ H<h Rep kt G are demanded to have bounded dimension as well as height, the objects 
[Xi] and [Yi] will not belong to Y[ H <h^ e PkiG, and hence the extension e Ylu^Pkfi 
will not belong to Yl H < h Rep k .G. 

Another problem we face in the case of higher Ext is in trying to define equivalence 
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of extensions in a first-order way. For Ext 1 this was no problem, since if two 1-fold ex- 
tensions are equivalent, there is necessarily an actual equivalence map between them, 
which is easily asserted in a first-order way. But this is not so for rz-fold extensions in 
general; equivalent extensions need not have an actual equivalence mapping between 



them (see section 2.4). 

To illustrate the problem, suppose we have, for each i, two equivalent extensions 
£i and Xi m the category d. What this says is that, for each i, there exists a finite 
sequence of rrii extensions pj, pf, . . . , p™* forming a chain of concrete equivalencies 
leading from £j to Xi- But there is every reason to suspect that m 8 diverges to infinity 
as % becomes large. As such, these equivalencies between £j and Xi i n the categories 
Ci do not necessarily project to an equivalence between the extensions and [xi] in 
the category H^Cj. If such a collection were found (and we have none in mind), this 
would in fact prove that the property of being equivalent is not first-order. 

With these difficulties in mind, we tried instead to prove the following inequality: 

Theorem 15.3.1. Let n,h GN, and let M and N be modules for G over Z. Suppose 
that the computation dim ExtQ h ^(M, N) = m (where m could possibly be infinite) is 
the same for every characteristic zero field k. Then for any sequence of fields hi of 
increasing positive characteristic 



dimExt n ^ h {ki) (M,N) > m 



for all sufficiently large i. 

But the obvious attempt at a proof of this falls apart as well. Suppose we had a se- 
quence [Ci] 1 , ■ ■ ■ , [£i] n of linearly independent extensions in the category Y[ H<h ^p k .G] 
then we would like to see that these project back to an almost everywhere sequence 
of linearly independent extensions . . . in the categories Q. But even this, as 
far as we can tell, is not guaranteed. Linear independence means that, whenever 
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a\£} © ... © a™£™ is a trivial extension, then a] — . . . — a™ — 0. Being a trivial 
extension in turn means that, whenever x% is an y extension, x% © © • • • © 
is equivalent to x%- But again, equivalence of extensions is not necessarily first-order, 
and so neither is the property of being a trivial extension. We see then that a linear 
dependence among the Q does not necessarily project to a linear dependence 

among the . . . , and we cannot automatically conclude that the a], . . . , a" 
are equal to zero for almost every i. 
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Appendix A 

Model Theory and First-Order 
Languages 

Here is a very basic and sometimes imprecise introduction to the notions of models 
and first-order languages; it will be just enough to get by. The reader is encouraged 
to consult [H] for an excellent introduction to the subject. 

We abandon the term 'model' for the moment and instead focus on the notion 
of relational structure. This is by definition a set X (called the domain of the 
structure) endowed with the following: a collection of rij-ary functions on X 
(functions from X ni to X, where rij G N), a collection {rj} of rij-ary relations on X 
(subsets of X" J ) , and a collection of 'constants' {c^}, certain distinguished elements 
of the domain. The various labels given to these functions, relations and constants is 
called the signature of the structure. Many (but not all) of the usual mathematical 
structures one comes across can be realized as relational structures. We do not at all 
demand that a signature be finite, but all of the examples given in this dissertation 
will have finite signatures. 

Example: a field k can be realized as a relational structure. A natural choice for 
signature might be the two binary functions + and *, the unary function — , and the 
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two constants and 1, representing the obvious. We abusively call this the 'signature 
of fields', realizing that a random structure in this signature is not at all guaranteed 
to be a field. 

The whole point of bothering with which symbols you choose to attach to a rela- 
tional structure is three-fold. Firstly, it determines the definition of a 'homomorphism' 
of relational structures (always assumed to be between structures in the same signa- 
ture); namely, a homomorphism is demanded to preserve relations, functions, and 
send constants to constants. Secondly, it determines the notion of a 'substructure' A 
of a structure B, which by definition must contain all constants, be closed under all 
functions, and such that the relations on A are compatible with those on B. Note 
for instance that we included the symbol — in the language of fields, whence any 
substructure of a field must be closed under negation. If we were to omit this symbol, 
this would no longer be the case; e.g. N would now qualify as a substructure of Q. 

Thirdly, and most importantly for us, the signature of a structure determines 
the structure's first-order language. Roughly speaking, the first-order language 
of a structure is the collection of all meaningful 'formulae' one can form, in certain 
prescribed ways, using the symbols of the signature as the primitive elements of the 
language. 

Any language, at the least, needs certain primitive verbs and nouns. In the context 
of first-order languages verbs are called predicates and nouns are called terms. For 
a given signature we define the terms of our language as follows: 

1. Any variable is a term (a variable is any convenient symbol you might choose 
not being used by the language already, e.g. x, y, a, b, etc.) 

2. Any constant symbol is a term 

3. If / is an n-ary function symbol in the language and t 1 , . . . ,t n are terms, then 
so is /(*!, . . .,t n ). 
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In the case of fields, 1 is a term, so is x, so is * x, and so is (x + y)*(l + (—z)). 
These represent the 'nouns' of our language. 

Next we need predicates, ways to say stuff about our nouns. This is the role 
fulfilled by the relational symbols of our language, as well as the binary relational 
symbol '=', representing equality, which we always reserve for ourselves. We define 
the atomic formulae, which one can think of as the most basic sentences belonging 
to our language, as follows: 

1. If s and t are terms, then s = t is an atomic formula. 

2. If r is an n-ary relational symbol, and t±, . . . , t n are terms, then r(t±, . . . ,t n ) is 
an atomic formula. 

In the case of fields, 1 = is an atomic formula, and so is x + y = 1 * z. The 
signature we chose for fields did not include any relational symbols other than '=', 
so all atomic formulae in this signature must be built from this. 

We are of course not content to restrict ourselves to these primitive formulae; we 
want to able to put them together using the usual logical symbols. Our primitive log- 
ical symbols are A, V, and -i, representing 'and', 'or', and 'not'. Thus, the following 
are all formulae in the first-order language of fields: — >(1 = 0) A x = y, -i(l + x = x), 
and -.(1 = 0) A -.(1 + 1 = 0) A -.(1 + 1 + 1 = 0) A -.(1 + 1 + 1 + 1 = 0). In 
higher order languages, there is indeed a notion of conjunction or disjunction of an 
infinite collection of formulae, but the definition of a first-order language explicitly 
disallows this. All logical combinations of formulae take place over finite collections 
of formulae. 

We finally have two more symbols, namely V and 3, representing universal and 
existential quantification. For any formulae $ in our language, and any variable x, 
we also have the formula Vx$ and 3x&. So, for example, in the language of fields, the 
following are formulae: V:r(0 * x — 0), (Vx)(Vy)(x *y = y * x), and -<(3x)(x * = 1). 
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It is important to remember that quantification is always understood to be over 
the elements of a structure; in particular, we have no concept in first-order logic of 
quantification over subsets of a structure. 

To make things manageable, we shall not hesitate to use abbreviations. For two 
formulae $ and $ ==>- \& is shorthand for \I/ V and $ •<=>■ \I/ is shorthand 
for ($ ==>- ^) A ==>- $). If x is a variable and $(x) is a formula in which the 
free variable (unbound by quantification) x occurs, then (3!x)$(x) is shorthand for 
(3x)($(x) A (Vy) ($(?/) ==>• x = y)). We shall be making several such abbreviations 
as we go along, and usually leave it the reader to convince himself that the intended 
meaning can be achieved using only the primitive symbols of our language. 

We say that a first-order formula is a sentence if it has no free variables. A 
(perhaps infinite) collection of sentences in a given first-order language is called a 
theory. If M is a relational structure and $ = {(fti : i G /} is a theory, we say that 
M is a model of $ if every sentence of $ is true in the structure M. Obviously not 
all collections of sentences have models; {1 = 0, — i(l = 0)} obviously has no model, 
whatever you interpret and 1 to be. 

Theorem A. 0.2. (Compactness theorem for first- order logic) Let $ be a collection 
of first- order sentences such that every finite subset of $ has a model. Then $ has a 
model. 

Proof. See theorem 5.1.1 of [TT]. □ 

Some oft used corollaries: 
Proposition A. 0.3. The following are all corollaries of the compactness theorem. 

1. If the first-order sentence (ft is equivalent to the infinite conjunction of the first- 
order sentences {ipi : i 6 J}, then (ft is equivalent to some finite conjunction of 
them. 
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2. If the first-order sentence <p is implied by the infinite conjunction of the first- 
order sentences {ipi : i 6 /}, then is implied by some finite conjunction of 
them. 



As an easy example of an application of compactness, let +, *, — , 0, 1 be the lan- 
guage of fields. See proposition 6.1.1| for the fairly obvious observation that "is a 



field" is expressible by a first-order sentence in this language. 
Proposition A. 0.4. Let L be the language of fields. 

1. The statement "has characteristic zero" , modulo the theory of fields, is not 
expressible by a first- order sentence of L. 

2. If <p is a first-order L-sentence which is true of every characteristic zero field, 
then <p is true for all fields of sufficiently large positive characteristic. 

Proof. For a fixed prime number p, define char p to be the first-order sentence 1 + 
1 + . . . + 1 = (p-occurrences of 1). Modulo the theory of fields, this is obviously 
equivalent to the assertion that the field is of characteristic p. Now the statement 
"is of characteristic zero" is by definition equivalent to the infinite conjunction of the 



sentences -ichar p for p — 2, 3, 5, By 1. of proposition A. 0.3 if this were expressible 

as a first-order sentence, it would be equivalent to some finite subset of this collection. 
But we know this is absurd; no finite collection of the sentences -ichar p can guarantee 
a field to be of characteristic zero. We conclude that "is of characteristic zero" is not 
first-order. 

Now suppose that the first-order sentence <j) were true in every characteristic zero 
field. This means that the infinite conjunction of the sentences -ichar p implies 0. By 



2. of proposition |A.0.3l <p is implied by some finite subset of them. Any field of large 
enough positive characteristic satisfies this finite collection of sentences, and hence 
satisfies d> as well. □ 
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Appendix B 



Ultrafilters 



The notion of a filter is sometimes given as a slightly more general definition then 
we give, but it suffices for our purposes. 

Definition B.0.1. Let / be a set. A filter on / is a non-empty collection T of 
subsets of / satisfying 

1. J 7 is closed under the taking of pairwise intersections 

2. If Y is a superset of some element of J 7 , then Y is in J 7 

3. The empty set is not in J 7 

A filter is called an ultrafilter if it is maximal with respect to inclusion among 
all filters. An ultrafilter is called principal if it is of the form {X C / : x G X} for 
some element x oi I. 

We sometimes call the elements of a filter large sets. If <f>(i) is some statement 
about elements of I we say that holds almost everywhere or for almost every 
i if the set on which <p(i) is true is large. 

Proposition B.0.5. A filter J 7 on I is an ultrafilter if and only if for any subset X 
of I, either X or its complement is in T . 
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Proof. Suppose J 7 does not contain X or its complement; we claim that J 7 can be 
enlarged to a new filter containing one or the other. First suppose that R, S G J 7 are 
such that X n R = and X c n S = 0. Then # C X c and 5 C X, whence Rn S = 0; 
but this cannot happen since T is closed under intersections and does not contain 0. 
Thus at least one of X or its complement is not disjoint from anything in J 7 , let's say 
X. Then define 7' = 7 U {S C / : S D X} U {S n R : R G J 7 , S D X}, which is 
easily seen to be a new filter properly containing J 7 . 

Conversely, if J 7 contains every set or its complement, then it is necessarily max- 
imal, since there are no new sets we can throw in; any such X would intersect with 
its complement to arrive at G J 7 . □ 

Examples: The collection of all subsets having Lebesgue measure 1 is a filter on 
the interval [0, 1], and the collection of all cofinite subsets is a filter on N. These are 
both obviously non-principal and non-ultra. 

Principal ultrafilters are boring and useless; we need non-principal ultrafilters. 

Proposition B.0.6. An ultrafilter U is non-principal if and only if it contains no 
finite sets if and only if it contains no singleton sets. 

Proof If U is principal, say generated by x G /, then obviously U contains the 
singleton set {x}. Conversely, suppose U contains the finite set X = {x±, . . . ,x n }, 
n > 1. Then at least one of the sets {x±, . . . ,x n _i} or its complement is in U. In 
the latter case we intersect with X to obtain {x n } G U, and in either case we have a 
new subset with less than n elements. Applying this process finitely many times will 
eventually yield some singleton {x} in U. Then any subset containing x is in U, no 
subset not containing x can be in U, and thus U is principal. □ 

Proposition B.0.7. Let I be an infinite set, and X C I any infinite subset of I. 
Then there exists a non-principal ultrafilter on I containing X . 
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Proof. Let C be the filter on I consisting of all cofinite sets, and enlarge it, as in the 



proof of proposition B.0.5 to contain X. Partially order the collection of all filters 
containing X by inclusion, which we just showed is non-empty. The union over any 
chain of filters qualifies as an upper bound for that chain; take a maximal element by 
Zorn's Lemma. It is guaranteed to be non-principal since it contains no finite sets, 



by proposition B.0.6 □ 



Corollary B.0.8. The subsets of I that are contained in every non-principal ultra- 
filter are exactly the cofinite subsets. 

Proof. If X is not cofinite, the previous proposition shows that its complement is 
contained in some non-principal ultrafilter, necessarily not containing X. If X is 



cofinite, then proposition |B.0.6| shows that every non-principal ultrafilter does not 
contain X° , and so contains X. □ 

Lemma B.0.9. Let U be an ultrafilter on I, J a member of U, and X\, . . . ,X n a 

finite collection of subsets of I which cover J. Then at least one of the Xi is inlA. 

Proof. Suppose none of them are in U. Then all of their complements are in U, as well 
as the intersection of their complements, which is contained in J c ; but this cannot 
be, since J c U. □ 

Lemma B.0.10. If U is an ultrafilter on I , J a member ofU, and X\, . . . ,X n a 

finite disjoint partition of J, then exactly one of the Xj is contained in IA. 

Proof. At least one of them is in U by the previous lemma, and no two of them can 
be, lest we take their intersection and arrive at e IA. □ 



With a view towards defining ultraproducts in the next section, we close with 

Proposition B.0.11. Let Xj_ be a collection of sets indexed by I, U an ultrafilter on 
I. Define a relation on Y\ i&I Xi (cartesian product of the Xi) as follows. For tuples 
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(xi), (yi) G H ieI Xi, (xi) ~ (yi) if and only if the set {i G I : x { = is in U. Then 
~ is an equivalence relation. 

Proof. Reflexivity is clear since necessarily I G J 7 , and symmetry is obvious. For 
transitivity, suppose (xi) ~ {yi) and {yi) ~ (zi). Then the set 

{i E I : Xi = 



contains at least the set 

{i £ I : Xi = yi} (~) {i £ I : yi = z { } 



which is in U by intersection closure. Then so is {i G / : Xi = z^}, by superset 
closure. □ 

We say that two such tuples are equal almost everywhere or on a large set if 

they are related through this relation, and we denote by [x^ the equivalence class of 
the tuple (xj). 
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Appendix C 



Ultraproducts 



Let Mj be a collection of relational structures in a common signature L, indexed by 
the set J, and fix a non-principal ultrafilter U on I. Then we define the ultraproduct 
of these structures relative to U, denoted M = Yi u Mi, to be a new L-structure defined 
as follows. 

The domain of M is the collection of all equivalence classes [xi] of tuples (xi) G 
n igi -Mj (cartesian product of the Mj) as defined in proposition B.0.11 For an n- 



ary relation symbol r, we define r([xj]i, . . . , [xi] n ) to hold if and only if, for almost 
every i, r(xj j i, . . . ,Xi >n ) holds in the structure Mj. For an n-ary function symbol /, 
f([xi]i, . . . , [xi] n ) is the element [/(a^i, . . . , x^ n )} of M, and the constant c corresponds 
to the element [xj], where X{ is the element of Xi corresponding to the constant c. 

Proposition CO. 12. For any ultrafilter U , the definition just given for M = [j^Mj 
is well-defined. 

Proof. We must show that the definitions given are independent of the choice of tu- 
ple (xi) one uses to represent the equivalence class [xi\. Suppose then that (xj)i ~ 
(y i ) 1 , . . . ,(xi) n ~ (yi) n , with x itl = y ifl holding on the large set J x , similarly for 
J2, . . . , J n . Let r be an n-ary relational symbol, and suppose that the relation 
r(xi t %, . . . , Xj n ) holds for almost every i, say on the large set J C I. Then the relation 
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r(y it i, . . . , y i>n ) holds at least on the set J\ D . . . fl J n fl J, which is large by intersection 
closure. Thus deciding if r([xj]i, . . . , [xj] n ) holds in M is independent of the choice of 
representatives. Identical arguments hold for function and constant symbols. □ 

Since we are assumed to be working over an ultrafilter, we can say something 
stronger: 

Proposition CO. 13. The relation r([xi]i, . . . , [xi] n ) does not hold in M if and only 
if, for almost every i, r(xi t i, . . . , x^ n ) does not hold in Mi. 

Proof. The 'if direction is true even in a non-ultra filter. For the converse, If 
r([xj]i, . . . , [xi] n ) does not hold, it is because the set on which r(x itl , . . . ,x i>n ) holds 
is not large. Then as U is an ultrafilter, its complement is large, namely the set on 
which r(xi t i, . . . , x^ n ) does not hold. □ 

This is the reason we demand our filters to be ultra; otherwise M preserves the 
primitive relations r i; but not necessarily their negations. The reason we demand our 
ultrafilters to be non-principal is because 

Proposition CO. 14. IfU is a principal ultrafilter, say generated by j G /, then M 
is isomorphic to M y 

Proof. Two tuples (xj), (yi) are then equivalent if and only if the set on which they 
are equal contains j, if and only if Xj = y r The map [xi] t-> Xj is thus easily seen to 
be an isomorphism of L- structures, preserving all relations and whatnot. □ 

The 'fundamental theorem of ultraproducts', what makes them worth studying at 
all, would have to be 

Theorem CO. 15. (Los' Theorem) Let U be an ultrafilter on I, M = Ylu^-i the 

ultraproduct of the structures M; L with respect to U. Let $(xi, . . . , x n ) be a first- 
order formula in the language L in the variables x±, . . . , x n , and let [aj]i, . . . , [aj] n be 
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a collection of elements of M. Then $([aj]i, . . . , [aj] n ) is true of M if and only if 
$(aj : i, . . . , a^ n ) is true of Mi for almost every i. 

Proof. See theorem 8.5.3 of [TT]. □ 

We've proved this theorem already in the case of atomic formulae or their nega- 
tions. The rest of the proof proceeds by induction on the complexity (i.e. length) of 
the formula. For example, if the theorem is true for the formulae . . . ,x n ) and 

ty(yi, . . . , y m ) then it is also true for their conjunction, by considering the intersection 
of two large sets, which is also large. 

Corollary CO. 16. If $ is a first- order statement in the language L, then $ is true 
of [jyMj if and only if it is true of almost every i. 

Proof. Sentences are just a particular type of formulae; apply Los' theorem. □ 

Ultimately, we are not particularly interested in what sorts of statements might 
hold in riw^ f° r a particular choice of non-principal ultrafilter, but rather those 
first-order statements that hold for every non-principal ultrafilter. 

Proposition CO. 17. Let $ be a first-order statement that holds in Y[ u Mi for every 
choice of non-principal ultrafilter on I. Then $ holds in Mi for all but finitely many 
% G /. 



Proof. Apply corollary CO. 16 and corollary B.0.8 □ 
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